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PREFACE 


Algebra is a subject with which we become acquainted during most of 
our education, largely in connection with the solution of equations. Some 
of the most famous questions in mathematical history have involved equa- 
tions with coefficients in Z , the set of integers. This course deals with their 
solutions. We shall see that the process of abstraction enables us to solve 
a variety of problems with economy of effort. This is the principle at the 
heart of abstract algebra, a subject that enables one to deduce sweeping 
conclusions from elementary premises. As such, this course can be used to 
initiate an intelligent student to the glorious world of mathematical discov- 
ery. At the same time, a course in abstract algebra, properly presented, 
could treat mathematics as an art as well as a science. In these notes I have 
tried to present underlying ideas, as well as the results they yield. 

Abstract algebra received a major impetus toward the beginning of this 
century, when “intuition” in geometry began to lead to false assertions. 
It was seen that algebraic structure provides a firm foundation for a new 
subject called algebraic geometry, which enabled Zariski and others to put 
the latest developments of geometry an a solid footing. Indeed, it is not 
surprising that much of the structure theory of algebra was developed by 
Emmy Noether, the daughter of a geometer. Although algebraic geometry 
is outside the scope of this short book, we do attempt to lay the pedagogical 
foundations by introducing Noetherian rings and prime ideals. 

These notes are far from comprehensive, and the serious student might 
continue with the far more thorough texts of Jacobson (Basic Algebra, Free- 
man 1985), Artin (Algebra, Prentice Hall 1991), and/or Cohn (Algebra J, 
Wiley 1974). Throughout, I have tried to stick to the main track of a one- 
year course, with the aim of touching on as many important theorems and 
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applications as possible. The observant reader will notice the heavy debt to 
the classic text of Herstein (Topics in Algebra, Xerox, 1964), which in turn 
owes a great deal to Van der Waerden’s pioneering work. The cursory treat- 
ment here is indicated by the division of the material into chapters, each 
of which is supposed to correspond to one of the 26 weeks of a year-long 
algebra course. (Some of these chapterss are extended a bit by appendices.) 

References are normally within the same chapter, unless indicated other- 
wise by a decimal point; for example, “Theorem 2.15” means “Theorem 15 
of Chapter 2.” 

My personal experience with this material is that often I am pressed 
towards the end and usually end a one-year course with Chapter 24, leaving 
most Galois theory for the next year. For those lecturers who would treat 
Galois theory in the first year, this raises the question of how one could push 
ahead in order to reach Chapters 25, 26, and 27. First of all, the addenda 
to Chapters 3 and 12 are not needed elsewhere in the text. Another two 
possible chapters that could be curtailed are Chapters 17 and 19. The 
second half of Chapter 17 contains a second proof of the basic theorem 
that Euclidean rings are Unique Factorization Domains (UFDs), by means 
of translating all the relevant concepts to ideals and then replacing the 
Euclidean degree function by considerations about ideals. Although one 
sees more generally that all principal ideal domains (PIDs) are UFDs, this 
generalization is nominal, since the motivating examples of PIDs are all 
Euclidean; indeed it is quite difficult to come up with an example of a UFD 
that is not Euclidean. Nevertheless, I would advocate a full presentation 
of this material for two reasons: It serves as an introduction to Noetherian 
rings and their techniques, one of the important advances in the early part 
of the twentieth century; and the methods are more elegant, belonging 
more intrinsically to the algebraic structure of rings, and indeed, ideals 
have taken on an even more important role in ring theory than has unique 
factorization. 

The material of Chapter 19 might perhaps be more vulnerable to the 
red pen, since its interest is partly historical. However, I feel that every so 
often the lecturer should step back and let the students see what can be 
reaped from the theories that they have labored so hard to master. For 
many years number theory was considered the epitome of human inquiry, 
attracting the attention of some of the greatest intellects in history, and 
the theories described in this book were motivated largely by the material 
in Chapter 19. 

Concerning the two appendices at the end of these notes, I feel uneasy 
relegating the transcendence of 7 to a corner unlikely to be reached in most 
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courses, but classroom experience indicates that this proof is perhaps the 
hardest of all for the students to digest, and perhaps the effort could be 
put to better purpose by explaining Galois theory more completely. 

Appendix B is a luxury, but it ties up several loose ends; and it intrigues 
me to see how far one can get in noncommutative algebra (including most 
of the Skolem-Noether theorem) simply by pushing forward some of the 
basic techniques of elementary abstract algebra. 

Although several of the exercises are more or less routine applications 
of the theorems, most exercises are extensions of the text, which usually 
require substantially more time to solve. A few exercises are intended to 
lead the reader to anticipate an upcoming topic; these exercises are usually 
difficult where they are presented, but become much easier later in the 
course. 

Of course the traditional role of exercises in a course is to provide more- 
or-less routine applications of the main results, for the student’s edification 
and also as possible material for examinations. These are provided as the 
Review Exercises at the end of the book. 

A note about proofs: In an elementary course one takes considerable 
care to find the “best” proof. The usual criterion in a mathematics book is 
the length of the proof (the shorter the better). However, several standard 
proofs involve sleight of hand, pulling some computation out of the thin 
air, so, in an effort to explain the underlying ideas, I have turned to the 
criterion, “Which proof is easiest to remember?” For the reader’s edification 
and amusement, several of the magic proofs have been put into the exercises. 

One dilemma in a course in algebra is deciding where to introduce ma- 
trices. Although various subsets of matrices provide some of the most 
important examples of groups, the full set of n x n matrices over a field has 
the structure of a “ring,” which usually is not defined until after groups 
have been studied for several months. (Indeed, we shall define a group in 
the first chapter.) Thus, strictly speaking, groups arising from matrices 
should not be introduced until far into the course; however, I did not want 
to wait to bring in such an important example. As a compromise, I have 
assumed that the reader is familiar with the definition and basic properties 
of matrices. 

I would like to express my gratitude to my colleagues Steve Shnider and 
Shalom Feigelstock for suggestions on improving the exposition, to Boaz 
Saban and Miriam Rosset for spotting errors in the draft version, and to my 
helpmate Rachel Rowen for sharing her expertise in computer typesetting. 
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PREREQUISITES 


Mathematics evolves around the study of various sets, so let us review 
briefly some foundations about sets. We write A C B to denote that 
the set A is a subset of B, i.e., every element of A is contained in B; if, 
furthermore, A # B, we may write A C B. The empty set is denoted as 0. 
A set S can be described either by a list of its elements or by means of 
a larger set. For example, perhaps the most fundamental set in algebra 
is N = {0,1,2,...} (the set of natural numbers); the set of even natural 
numbers can be designated either as {0,2,4,...} or {n € N:n is even} or 
simply as {2n : n € N} (which will be condensed even further to 2N). 

Given an arbitrary set S and A,B C S, we write ANB for {s € S: 
s € Aand s € B}; we say A and B are disjoint if AN B = 0. Likewise we 
write AU B for {s € S: s € Aor s € B}. One can think of AN B as the 
largest set contained in both A and B, in the sense that any set contained 
in both A and B must also be contained in AN B. Similarly AU B is the 
smallest subset of S containing both A and B, in the sense that any subset 
of S containing both A and B must contain AUB. We shall use “the largest” 
and “the smallest” in this sense, throughout the text. (On the other hand, 
“maximal” is used in the slightly weaker sense that there is nothing larger. 
For example, if we are given the subsets A; = {0, 1, 3}, A2 = {0, 2, 3}, 
and A; = {0, 3} of N, we would say A; and A2 are maximal among these 
three subsets, although none of these is the largest.) 

U and N satisfy the familiar axioms of associativity and distributivity for 
unions and intersections of subsets A, B,C of a set S: 
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AU(BUC) =(AUB)UC; 
AN(BNC) =(ANB)NC; 
AU(BNC) =(AUB)N(AUC) 
AN(BUC) =(ANB)U(ANC). 


Similarly, given an arbitrary collection of subsets A; : i € I of A, we define 
their intersection Ney A; or union Ujes Aj. 

A function f from a set A to a set B is denoted as f:A — B, and 
is also called a map. (We stipulate that the image f(a) is defined for 
each a in A.) Maps between sets turn out to be as important as the sets 
themselves. Given a map f: A - B we write f(A) for {f(a):a€ A} C B; 
we say f is onto if f(A) = B. On the other hand, we say f is one-to-one, 
written 1:1, if for any elements a, # az in A we have f(a,) # f(az2) in B. 
(It is usually easier to verify the equivalent formulation that f(a,) = f(a2) 
implies aj = a2.) A map which is 1:1 and onto is called a 1:1 correspondence, 
or bijection. 

Given a map f:A > B and b € B, we define the inverse image f=} 
as {a € A: f(a) = b}. Analogously for S C B, we define f~'(S) as 
{a € A: f(a) € S}. Of course f~! need not be a map, but if f is a bijection, 
then f~! is a map and indeed is also a bijection. For maps f: A > B and 
g: B + C, the composite h = go f: A > C is defined by h(a) = g(f(a)). 

The Cartesian product A x B of sets A,B is the set of ordered pairs 
{(a,6):a€ A, b€ B}. A binary relation ~ on a set A is defined formally 
as a subset R of A x A; usually we simply write a ~ b if (a,b) € R. For 
example the relation “a < b” is defined on N as {(0,1), (0,2), (1,2),...}; 
the relation “equality” in A is defined as {(a,a) : a € A}. Generalizing 
“equality,” one says a relation ~ is an equivalence if it satisfies the following 
properties: 

1. (“reflexivity”) @ ~ a for all a in A; 

2. (“symmetry”) a ~ b implies b ~ a; 

3. (“transitivity”) a ~ b and b~c imply a~c. 
For example parallelism of lines in the Euclidean plane is an equivalence 
relation (provided one says a line is parallel to itself.) 

Given an equivalence ~ on A, we define the “equivalence class” [s] of any 
element s of A to be {a € A: a~ s}. Note that any element belongs to its 
own equivalence class, by reflexivity, so A is the union of various equivalence 
classes. On the other hand, two equivalence classes either coincide or are 
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disjoint; hence, A is the disjoint union of its equivalence classes. The set of 
equivalence classes of A is denoted as A/ ~. 

Conversely, we define a “partition” of A to be a set of disjoint subsets 
whose union is A; any partition defines the equivalence relation defined by 
stipulating that a ~ 6 iff they lie in the same subset in the partition. (By 
“iff? we mean “if and only if.”) 

In algebra one often introduces new algebraic structures by means of 
equivalence classes. The reader may have encountered the following con- 
struction of the integers Z, from N : Define the equivalence ~ on N x N 
by 

(a,,@2) ~ (b1,b2) iff a, +b, =a2+b,. 


For example, (4,7) ~ (5,8). We define Z formally as the set N x N/ ~. 
Intuitively the equivalence class of (a;,a2) has been identified with the 
integer a, — az, since (a posteriori) 


(a1,42) ~ (b1,62) iff a, — a2 = by — bp. 


One can define algebraic operations on the equivalence classes, by means 
of representatives from the classes. For example one defines addition on Z 
by 
[(a1, a2)] + [(b1, b2)] = [(a1 + b1, a2 + 62). 


The equality -3+4 = 1 can be expressed as [(1,4)]+([(6, 2)] = [(7,6)]. This 
poses a new difficulty: one has to show that the outcome is independent 
of the particular representatives we chose in the equivalence classes. For 
example, if we used (2,5) instead of (1,4) we would wind up with (8,7) 
instead of (7,6), which is all right since they are equivalent. This condition 
is called “well-defined” and is one of the nuisances that we must contend 
with in many constructions. 

Another example of construction by means of conjugate classes is given 
following Definition 2, below. 

We shall assume such familiar properties of N as the unique factorization 
of any whole number > 1 into primes. In fact this follows from the theory 
to be developed in Chapter 16, but it is convenient to use these properties 
earlier, to develop examples in group theory. The reader is assumed to 
be familiar with the method of proof by mathematical induction, used in 
proving the bulk of our theorems, cf. Exercise 1. 

One fundamental property of N is that every nonempty subset S of N 
contains a unique smallest element. This can be proved by mathematical 
induction, and actually provides an alternate version of induction that will 
be used quite frequently in the theory of finite groups. 
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Much of mathematics involves the sets Q (the rational numbers), R (the 
real numbers), and C (the complex numbers), so we are led to study the 
properties these sets have in common. One can check easily that in each of 
these sets the following axioms hold with respect to addition and multipli- 
cation, for any elements a, b, and c: 

(Fl) (a+b) +c=a+(b+c) 
(F2) a+0=0+a=a 

(F3) a+b=b+a 

(F4) a+ (—a) = (-—a)+a=0 
(F5) (ab)c = a(bc) 

(F6) al=la=a 

(F7) ab = ba 

(F8) at = 4a =1 whenever a #0 
(F9) a(b+c) =ab+ac 

(F9') (b+c)a = ba+ca 

(Of course (F9’) is superfluous here in view of (F7).) We are now in 
position to cross the threshold of abstract algebra — why not define an 
abstract entity satisfying (F1) through (F9)? Then we could examine its 
properties and apply the ensuing results to Q,R, and C, thereby saving 
ourselves the trouble of studying each set separately. In order to do this we 
need more than just the set; we need to define “operations” corresponding 
to addition and multiplication. Accordingly, one defines a binary operation 
onaset S to beamap SxS > S, i.e., it takes an ordered pair of elements 
of S and assigns an element of S as the answer. For example, the binary 
operation + on Q takes (4, —7.5) to their sum, —3.5. 

Please note that each binary operation is assumed to be defined for 
every pair of elements (s;,s2) of S x S; in some treatments this property 
is isolated as a separate axiom (called closure), but here it is subsumed in 
the definition. 


Definition 1. A field is a set F, together with binary operations (denoted 
as + and ~° and called addition and multiplication) and designated elements 
0,1 € F, such that: 
(i) properties (F1),(F2),(F3),(F5),(F6),(F7), and (F9) hold for all a, b,c 
in F; 
(ii) For any element a in F there is a unique element denoted (—a) 
satisfying (F4); 
(iii) For any a # 0 in F there is a unique element denoted i, or aW}, 
satisfying (F8). 


Having defined a field we might look for more examples. The advantage 
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of arranging the properties as in Definition 1 is that the properties in (i) 
pass at once to subsets. On the other hand, Z, with the usual addition 
and multiplication, satisfies (i) and (ii) but not (iii), since 4 is not in Z; 
N satisfies (i) but neither (ii) nor (iii). Nevertheless, one can show that Q as 
a set is in 1:1 correspondence with Z (as well as with N). This points to 
the principle that operations accompanying a set are quintessential to the 
algebraic theory; the aggregate of set, operations, and designated elements, 
is called the (algebraic) structure. Here is an example that is very important 
in number theory, as we shall see. 


Definition 2. Given m > 1, define Z,, = {0,1,...,m— 1}, provided with 
the addition and multiplication of “clock arithmetic,” i.e., addition and 
multiplication taken modulo m, also written (mod m). 


To understand Z,, more fully, we need a formal description. Define 
mZ = {mn:n € Z} = {...,-m,0,m,2m,...}, the multiples of m. We 
define the equivalence a = b (mod m) (for a,b € Z), read “a is congruent 
to b modulo m,” iff a—b € mZ. Let us write [a] for the equivalence 
class containing a; thus [a] = {a + m] = [a—m] =... . In this way 
Z is partitioned into m equivalence classes [0], [1],...,[m — 1]. Now define 
addition and multiplication by [a] + [6] = [a+] and [a][b] = [ab]. One must 
check that this definition is well-defined, cf. Exercise 1, but then it is easy 
to transfer properties from Z to Z,,, merely by writing brackets wherever 
appropriate. 

In this manner we see Z,, satisfies (i) and (ii) of Definition 1 (taking 
—[a] to be [—a] = [m — a]}), but again (iii) may fail. Indeed the reader can 
easily check that [2]~! does not exist in Z4, leading us to the question: “For 
what m is Z,, a field?” We shall soon see (Corollary 1.13) that Z,, is a field 
iff m is a prime number; for example [2]! = [6] in Zi). 

Of course the field Z, (for p prime) has p elements, and the existence of 
finite fields will be important in our study of groups. Although Z, suffices 
for many applications, the reader should be apprised that there is a field 
having precisely n elements, if and only if n is a power of a prime number; 
this and other facts about finite fields will be proved much later, in Chapters 
24 and 26. 

As mentioned earlier, the reader is assumed to be familiar with matrices 
over R. Analogously, the set of n x n matrices with entries in an arbitrary 
field F also is endowed with the analogous matrix addition and matrix 
multiplication and is denoted as M,,(F). Let us define the n xn matric unit 
e,; of M,,(F) to be the matrix with 1 in the i— 7 position and 0 everywhere 
else. Then the following properties are satisfied: 
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(i) e11 +°+* + enn = 1; 
(ii) exjejx = Ci; 
(iii) CijCuv =0 if 7 # u. 
Any element of M,,(R) can be written uniquely in the form Li. jar Ties 
for r;; in R. Matric units are a significant aid for computing with matrices. 


Exercises 

1. Prove that addition in Z defined formally in the text actually is 
well-defined. Similarly, define multiplication in Z, and verify (i),(ii) 
of Definition 1, by induction. (This exercise becomes rather boring 
after a while, once one gets the hang of what is going on.) 

2. Prove in any field F that a,b 4 0 implies ab # 0. 

3. Define Q(./—1) to be {a+b /-=1 : a,b € Q}, which can be viewed as 
a subset of C. Prove Q(\/—1) is a field. (Hint: the only challenging 
part is the multiplicative inverse.) 

4. Define Q(V/2) to be {a+ bV2:a,be Q}, viewed as a subset of R. 
Prove this is a field, under the natural addition and multiplication: 


(a + bV2) + (c + dV2) = (a+c) + (b+d)V2 
(a + bV2)(c + dV2) = (ac + 2bd) + (ad + be) V2 


Although Q(/2) is clearly in 1:1 correspondence with Q(/—1), 
their algebraic structures as fields are different. Explain. 


PART I — GROUPS 


In this course, abstract algebra focuses on sets endowed with “algebraic 


structure,” 


and axioms describing how the elements behave with respect to 
the given operations. The operations of basic concern to us are multipli- 
cation and/or addition, in various contexts. In this spirit, we should start 


with some observations about Definition 0.1: 


1. Property (iii) (axiom (F8)) relies only on multiplication, not addi- 
tion; 

2. Properties (F5) through (F7) are the multiplicative analogs of 
(F1) through (F3), and (F8) is the analog of (F4), except that we 
delete {0}. 


Thus it makes sense to isolate properties (F1)-(F3) with an eye to in- 
clude (F4) shortly thereafter; then using the multiplicative analog (deleting 
{0}) we would understand (F5) through (F8). This will lead us first to the 
definition of “monoid” and then to “group.” Abstract groups appear in al- 
most every branch of mathematics and physics, as well as in other sciences 
and even in many aspects of day-to-day life (such as telling time). Our 
object is to develop enough of the theory of groups to enable us to answer 
basic questions concerning their structure and to familiarize the reader with 
certain groups that he or she is likely to encounter repeatedly in the fu- 
ture. Various interesting classes of groups are easier to study than groups 
in general. We shall obtain rather decisive theorems concerning Abelian 
groups; in Part III we shall need “solvable groups,” a useful generalization 
of Abelian groups that is studied in Chapter 12. 
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CHAPTER 1: MONOIDS AND GROUPS 


Definition 1. A monoid (M,-,e) is a set M with a binary operation - and a 
neutral element e (also called the identity) satisfying the following properties 
for all a,b,c in M: 

(M1) (associativity) (ab)c = a(be); 

(M2) ae =ca=a. 


(Note that, as is customary in multiplicative notation, we write ab in- 
stead of a-b. On the other hand, often - will be +, which is never suppressed 
in the notation.) 

Note that the neutral element is uniquely determined by the opera- 
tion; indeed if e and e’ are neutral elements then e' = e’e = e. Thus we 
usually delete “e” from the notation; for example (Z,+) can only mean 
(Z,+,0). We shall often delete the operation also, and write M for the 
monoid (M,-,e). 

We say elements a and b in M commute if ab = ba. The monoid M 
is commutative if ab = ba for all a,b in M. When the set M is finite, we 
say M is finite and write |M| for the number of elements of M, called the 
order of M. The particular operation plays a crucial role in the structure 
of the monoid. For example (Z2,-+,0) and (Zo, ,1) are finite monoids each 
of order 2, but whose structures are not analogous. (In the first case the 
neutral element is 0 and satisfies 0 = 1+ 1, but in the second case the 
neutral element 1 satisfies 1 4 0-0.) 


Note. Associativity enables us to write products without parentheses, with- 
out ambiguity, cf. Exercise 1. 


Let us now turn to the key notion, bringing in (F8). We say an element 
a of M is left invertible (resp. right invertible) if there is 6 in M such that 
ba = e (resp. ab = e); a is invertible if a is left and right invertible. 

Suppose a is invertible. Then there are elements 6,6’ such that ba = 
e = ab'. Let us see that 6 = b'. Indeed b = b(ab') = (ba)b’ = b'. Thus a has 
a unique element 6 such that ba = e = ab, which is called the inverse of a, 
denoted at. 


Definition 2. A group is a monoid in which every element is invertible. 
A commutative group is called Abelian, after the Norwegian mathe- 
matician Abel. 


Examples of Groups and Monoids 
Let us start with some basic examples. 
(1) If F is a field then (F,+) and (F \ {0},-) are Abelian groups, seen 
by axioms (F1) through (F4) and (F5) through (F8) respectively 


(given in the introduction). (See exercise 0.2 to clean up a sticky 
point.) 

(2) (Z,+) is an Abelian group, but (Z\ {0}, -) is a commutative monoid 
that is not a group. 

(3) (Zm,+) is an Abelian group of order m; (Zm,-) is a commutative 
monoid. 

(4) (See prerequisites.) For any field F and any n, write M,,(F) for the 
set of n xX n matrices with entries in an arbitrary field F, endowed 
with the usual matrix addition and multiplication. (M,(F),+) is 
an Abelian group. (M,,(F),-,1) is a monoid (but not a group) that 
is not commutative for n > 1, since the matrices ( 4) and (| 5 
do not commute. (These matrices are not invertible, since each 
has rank 1. Two invertible 2 x 2 matrices that do not commute 


are € i‘) and é ) .) One can define adjoints and determinants 


in the usual way; then A adj(A) = det A for any A in M,,(F), so 
A! = adj(A)/ det A exists whenever det A # 0. 

Aside. This argument also shows AB = 1 implies BA = 1, for 
any two matrices A, B, cf. Exercise 8. 

(5) Suppose S is a set. Then {functions f:S — S} form a monoid 
which we denote as Map(S, S$), whose operation is composition; the 
neutral element is the identity map 1g defined by 1s(s) = s for all 
sin S. 

(6) The trivial group {e} has multiplication given by ee =e . 


Note. Although the operation of a group often is +, it is customary to 
choose multiplicative notation when studying groups in general, since “++” 
biases us toward commutativity. 

One can extract a group from an arbitrary monoid. To understand 
this procedure let us first examine invertibility. 


1 1 


Remark 3. Suppose a,b are invertible. From the equation a” *a = e = aa 
we also see a = (a~')7!. Furthermore abb~1a~1 = e = b-1a7~1ab, implying 


(ab)~t = b-tatt. 
Given a monoid M, write Unit(M) for {invertible elements of M}. 
PROPOSITION 4. Unit(M) is a group. 


Proof. Associativity in Unit() follows at once from associativity in M. 
But e € Unit(M), so Remark 3 implies Unit(M) is closed under multiplica- 
tion and is thus a monoid. Moreover if a € Unit(M), then a~! €Unit(M) 
so Unit(M) is a group. O 


Example 5. Applying Proposition 4 to Examples (1) through (5) yield the 


following useful groups: 
(1) Unit(F,-) = F \ {0} for any field F. 
(2) Unit(Z,-) = {£1}. 
(3) Unit(Zm,-) = {a:1<a@< m, and a is invertible mod m}. This 


— 


is called the Euler group, denoted Euler(m). For example Euler(6) 
= {1,5}; Euler(7) = {1,2,3,4,5,6}, and Euler(8) = {1,3,5,7}. The 
order of Euler(m) is called the Euler number p(m). Thus y(3) = 
y(4) = (6) = 2, and y(5) = v(8) = 4. 

Unit(M,,(F),-) is the group of regular n x n matrices over F, called 
the general linear group and denoted GL(n, F). 

Let us pause for a moment, to assert that GL(n, F) is per- 
haps the most important group in mathematics, since it can be 
interpreted as the group of invertible linear transformations of an 
n-dimensional vector space over the field F; as such, it has funda- 
mental significance in geometry and in physics. (This identification 
also enables us to prove various properties of GL(n, F), cf. Exercise 


6.) 


Other “geometrical” groups can be defined similarly in terms of various 
kinds of linear transformations, cf. Chapter 2 and Exercises 12.9ff, 12.15ff. 
Nevertheless, the focal role in these notes is played by the next example. 


(5) Unit(Map(S,S)) is denoted as A(S), the 1:1 onto maps from S 


to S. In the special case S = {1, ... ,n} we denote A(S) as S,, the 
group of permutations of n symbols. S;,, often called the symmetric 
group, plays a special role in finite group theory and will be used 
throughout as our main example. The reader is urged to trace its 
development via the index. 


When Is a Monoid a Group? 


We want to explore the fundamental question of when a given monoid is 


already a group. One basic property of groups will become focal. 


Remark 6. If ab = ac with a left invertible, then 6 = c. (Indeed, multiply 
by the left inverse of a.) 


Accordingly we call a monoid (left) cancellative if it satisfies the prop- 


erty 


ab = ac implies b =c 


for any elements a,b,c. An example of a cancellative monoid which is not 
a group is (Z\ {0},-). On the other hand we have 


THEOREM 7. Any finite cancellative monoid M is a group. 
Before proving Theorem 7, let us accumulate some facts. 


LEMMA 8. If every element of a monoid M is right invertible, then M is a 
group. 


Proof. We need to show any element a of M is invertible. By hypothesis 
there is b such that ab = e; likewise there is c such that be = e. But then 6 
is invertible, so a = 671, as noted after Definition 2, implying 6 = a7'. O 


Fact 9. Suppose S is a finite set. A function f:S — S is 1:1 iff f is onto. 


This fact is called the “pigeonhole principle”; for if a letter carrier is 
to distribute 17 letters into 17 boxes, clearly each box receives a letter iff 
no box receives at least two letters. Of course the pigeonhole principle fails 
for infinite sets, as illustrated by the map f:N > N given by f(n) =n+1 
for all n in N (which is 1:1 but not onto). The pigeonhole principle relates 
to monoids and cancellation via the following key observation. 


PROPOSITION 10. Suppose M is a monoid. Given s € M define the left 
multiplication map t,:M — M by ¢,(a) = sa for allain M. 


(i) &, is onto iff s is right invertible; 


(ii) ¢, is 1:1 iff sb # sc for all b,c in M. 


Proof. (i) (=) e = £,(s') for some s’ in M; thus ss’ =e. 
(<=) If ss’ =e then a = ss'a = £,(s'a) for any a in M. 
(ii) Self-evident. O 


Proof of Theorem 7. By Proposition 10(ii) we see for each s in M that @, is 
1:1 and thus onto by the pigeonhole principle. Hence each s in M is right 
invertible, by Proposition 10(i), so, by Lemma 8, M is a group. O 


Remark 11. The proof of Theorem 7 shows that for any element g of a 
group G the left multiplication map £4:G — G is 1:1 and onto. We shall 
return to this fact later. 


Let us apply theorem 7 to divisibility and the Euler group. 


Remark 12. Writing gcd(a,b) to denote the greatest common divisor of a 
and 6, we claim Euler(m) = {a € N: 1 < @ < m and ged(a,m) = 1}. 
Indeed, using unique factorization in N, it is easy to identify the right-hand 
side with a cancellative monoid contained in Z,,. (If gcd(a,,m) = 1 and 
ecd(az,m) = 1, then gcd(a,a2,m) = 1; also if ab = ac in Zp, then ab = ac 
(mod m), implying m divides ab — ac = a(b—c), so m divides b—c.) Hence 
the right-hand side is a group so by definition is contained in Euler(m). 


On the other hand any invertible element a of (Z»,-) is relatively 
T 
implying by remark 6 that 4 = 0 (mod m), so d= 1. Thus we have proved 


prime to m, for if d € N divides both a and m, then a0 = 0 = 4m =a 


equality. 0 


COROLLARY 13. A number is invertible mod m iff it is relatively prime 
to m. In particular Euler(p) = {1,...,p — 1} iff p is prime. 


CoROLLARY 14. Z, is a field, for any prime number p. 


CoroLiary 15. {7, 2 +4, ..., 2+ (p— 1)t} are all distinct (mod p), for 
p prime, whenever t # 0 (mod p). 


Proof. One may assume 7 = 0; but then the first term is 0, and the others 
{t, ..., (p—1)t} are distinct and nonzero (mod p), seen by canceling ¢ in 


Euler(p). O 
CoROLLaryY 16. Ifgcd(m,n) = 1, then there are a,b in Z with am+bn = 1. 


Proof. Let 6 =n7! in Euler(m). Then m divides 1 — bn, so 1 — bn = am 


for some ain Z.O 


Exercises 


1. Define a1 a.a3 = (a@1@2)a3 and, continuing in this way, define 
Q1...G6n = (A ...An-1) Gn. 


Prove that (@1...@m)(Gm41-.-@n) = @1...@,, for any n and any 
m <n. (Hint: Induction on m.) 

2. Define A+ B= {a+b6:a€ A, 6 € B} for subsets A,B of Z. 
Compute 3Z+ 4Z,6Z+10Z, and in general aZ-+ bZ. 

3. Show if a, = by and ay = by (mod m) then a; + a2 = by + by and 
a1 = bybe. Conclude that addition and multiplication in Z,, (as 
defined via equivalence classes) is well-defined. 

4. If a? =1 (mod p) for p prime, then a = +1 (mod p). (Hint: Show 
that p divides a? — 1 = (a+1)(a—1).) What happens for p not 
prime, e.g.,p =15? p=21? 

5. Write down the Euler groups Euler(n) for all n < 10. For which 
values of n does Euler(n) have order 2? Show that these groups are 
the “same” in some sense (to be made precise in Chapter 4). 

6. Suppose F is a finite field of q elements (for example, if q is prime 
one could have F = Z,). Then the group GL(n, F) has order 


OSG Sa).2c(g? =o" *) 
=q"'™ D/P (g? —1)(q? 1 =1)...(q— 1). 


10. 


T. 


(Hint: One can define an arbitrary invertible linear transformation 
by sending a given base to any other base, so |GL(n, F)| is the 
number of different possible bases of the vector space F'” over F 
(where the order of the base elements is significant). The first vector 
in the base can be any nonzero vector and thus chosen (g” —1) ways; 


mn 


the next vector can be chosen (g” — q) ways, and so forth.) 

How many concrete examples of finite groups can you produce at 
this stage? What is the non-abelian group of smallest order? 
Consider the property ab = 1 implies ba = 1. Show this property 
holds in M,,(F) (F a field) and in any finite monoid. However it 
fails in Map(S,5}) when S is an infinite set. (Hint: Define f: N > N 
by f(n) =n+1.) 

A semigroup is a set with a binary operation satisfying M1 of def- 
inition 1, but not necessarily M2. Any semigroup S can be made 
into a monoid, simply by adjoining a formal identity e and defining 
ea = ae = a for all a in S. Nevertheless, semigroups (without 1) 
arise in several contexts, where the elements are not necessarily in- 
vertible. (For example, let S be the set of functions R > R that 
are continuous in the neighborhood of a point.) 

Inspired by the example in Exercise 9, call a semigroup S regular if 
for each a in S there is b in S' such that aba = a and bab = b; if 6 is 
unique it is called the inverse of a. An inverse semigroup is a regular 
semigroup for which every element has an inverse. Prove the analog 
of Remark 3 for inverse semigroups. Inverse semigroups have com- 
manded considerable attention in research, and have applications 
in computer science. 

An inverse semigroup that is cancellable is a group. 


CHAPTER 2. How To DIVIDE: LAGRANGE’S THEOREM, COSETS, 
AND AN APPLICATION TO NUMBER THEORY 


Our goal today is to introduce subgroups and show that the order of a 
subgroup divides the order of the group. This raises a basic question in 
arithmetic: Given two positive numbers m and n, how do we verify that m 
divides n (written m|n)? Two methods are already available from elemen- 
tary school: 


(1) The Euclidean Algorithm says that n = gm-+r for a suitable quotient 
gq > 0 and remainder r with 0 < r < m, so one need merely check 
r=0. 

(2) Possibly even more fundamentally, m|n iff one can partition a set 
of n elements into subsets each having m elements. 


We shall use each of these methods here, but first let us turn to the 
key ingredient of our present discussion. We have defined various algebraic 
structures thus far — fields, groups, monoids. When a given subset acquires 
the same structure from the larger set, we append to it the prefix “sub.” 
For example: 


DEFINITION 1. A submonoid of a monoid (M,-,e) is a subset N which 
becomes a monoid under the same operation - and neutral clement e. 
A subgroup of a monoid is a submonoid which is a group. 


Sometimes a group can be described most easily as a subgroup of 
another group. For example, one famous subgroup of (C \ {0},-) is the 
“unit circle,” defined as {complex numbers having absolute value 1}. 


Remark 2. To show that the subset N is a submonoid of M, it is enough 
to check that e € N and N is closed under - ; axioms (M1) and (M2) of 
Definition 1.1 automatically hold in N (since they hold in the larger set M). 


We shall be particularly interested in subgroups. One sees easily that 
Unit(M) is the largest subgroup of M, in the sense that it contains every 
subgroup of M. Thus M and Unit(M) have the same subgroups, so our 
attention focuses naturally on subgroups of groups. 

A submonoid of a group need not be a group; e.g., (N,+) is a sub- 
monoid of (Z,+). Nevertheless a submonoid of a group is cancellative by 
Remark 1.6; consequently Theorem 1.7 yields 


Remark 3. Any finite submonoid of a group is a subgroup. 


We write H < G to denote that H is a subgroup of a group G. Obvi- 
ously {e} and G are subgroups of G. A subgroup H of G is proper when 
HA #G; in this case we write H < G. Much of the theory of groups relies 
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on studying subgroups, so we would like an easy criterion to verify whether 
a given subset is a group. 


PROPOSITION 4. Suppose G is a group and H is a nonempty subset. Then 
A < Gif either of the following two criteria is satisfied: 

(i) hihy! € H for any hy,he in H; 

(ii) H is finite and is closed under the group operation. 


Proof. (i) Take any hin H. Then e = hh-! € H. Nowh-t=eh"1€ A 
for any hin H. Finally, hyhy = hi(h,')~1 € H. We have just proved H is 
a submonoid closed under inverses and so is a subgroup. 

(ii) In view of Remark 3 we need merely check that e € H. Take 
any h in H; noting {h* : i > 0} is finite, we must have h’ = h/ for some 
i>j. Butthene=h*JE€H.O 


Any group G has the trivial subgroups {e} and G itself. All other 
subgroups (if they exist) are called nontrivial. 


Example 4': Some examples of nontrivial subgroups. 

(i) Suppose FC F are fields. Then F'\ {0} is a subgroup of (#\{0},-). 
On the other hand, (# \ {0},-) has other subgroups. One famous subgroup 
of (C\ {0}, -) isthe “unit circle,” the set of complex numbers having absolute 
value 1. 

(ii) The nontrivial subgroups of Euler(7) = {1,2,3,4,5,6} are {1,2, 4} 
and {1,6}. In general, {1, » — 1} is a subgroup of Buler(n). 

(ili) mZ = {mt :t © Z} < Z. We shall see shortly that these are the 
only subgroups of Z. 

(iv) If m|n then {[0], [m],[2m],...,[2 — m]} < (Zy,+). We shall see 
later that these are the only subgroups of Zp. 

(v) GL(n, F) has many important subgroups, some of which are given 
in example 3.15. Other subgroups of GL(n, F) include the subset of all 
invertible diagonal matrices, and the subset of upper triangular matrices 
having nonzero diagonal entries. 


The “smallest” subgroup containing a given invertible element a is 
called the cyclic subgroup generated by a and is rather easy to construct. 
Namely, define a® = e and a’ = a...a (where a occurs i times in the 
product) for any i > 0; define a~* = (a*)"!. Then {a' : i € Zhisa 
subgroup that is a subset of every subgroup containing a and thus is the 
cyclic subgroup generated by a. 

Since our ultimate concern is groups, we work now in a group G and 
write (a) for the cyclic subgroup generated by a € G. Note that if {a* : i> 
1} is finite, then this is already a group by Proposition 4, so equals (a). In 


particular a” = e for some n > 1; the smallest such n is called the order 
of a, denoted o(a). Note that o(a) = 1 iffa =e. 


PRoposITION 5. If o(a) =n, then (a) = {e,a,...,a"- 1}, and a” =e. 


Proof. It is easy to see that the finite set {e,a,...,a"~1} is closed under 
multiplication (since a” = e) and thus is a group containing a. It remains 
to show that e,a,...,a"~! are distinct. But if a’ =a! forO<i<j<n, 


then a/—* = e, contrary to the minimality of n. 0 
Coro.iary 5!. (for G finite) (a) = G iff o(a) = |G}. 


If o(a) divides a number m then a” = e, indeed, writing m = o(a)t we 


* =e, Let us examine the converse. 


see a™ = a(t — 
PROPOSITION 6. If a™ = e, then o(a) divides m. 


Proof. Let n = o(a). Using the Euclidean algorithm, we divide n into m 
and check that the remainder is 0. More precisely we write m = nq+r 
where 0<r <n. Then 


a” =a™ "4=a™(a") 1=ce=e. 


But r <n = o(a), contrary to definition, unless r = 0. 0 


Coro.iary. If m divides o(a) then o(a™) = 4. 


™m 


Here is another instance of the same argument. 


Example 6': Every subgroup of Z is of the form mZ for suitable m > 0. 
Indeed if 0 #4 H < Z take m > 0 minimal in H. Clearly mZ < H, and we 
claim mZ = H. Indeed take any h € H. Then h = mq+r7r for suitable 
0<r<m. But r=h-— meq € A since mq € mZ C A. By minimality of 
m we conclude r = 0, «.e.,h =mqe€mZ. 0 


Note that nZ < mZ iff m|n; in particular nZ = mZ iff n = +m. 


Cosets 
Usually it is rather difficult to determine precisely the subgroups of a given 
group G, so we would like at least to describe certain properties of subgroups 
of G. The most basic property perhaps is the order, at least when G is finite. 
Note that any subgroup H of G contains {e}. Thus |H| = 1 iff H = fe}. 
On the other hand |H| = |G| iff H = G. 

Thus |H| provides information about the subgroup H itself, and our 
goal is to see how || is related to |G|. We shall prove the startling theorem, 
due to Lagrange, that |H]| divides |G], for any subgroup H of G. This time 


the idea is to partition G into subsets each having order ||. Obviously H 
itself is such a subset, and we look now for others. 


Definition 7. A (right) coset of H in G is a set Hg = {hg : h © H} where 
g € Gis fixed. 


Remark 7'. Right multiplication by g provides a 1:1 correspondence from 
H to Hg, so |Hg| = |A|. 


It remains to show that the cosets of H comprise a partition of G. 
(This could be done at once, using equivalence classes, cf. Exercise 1, but 
at this stage I prefer a more explicit approach.) Clearly for any g € G, we 
have g = eg € Hq. It follows at once that G = Ugeg Hg, so it remains for 
us to show that distinct cosets are disjoint. 


Remark 8. The following are equivalent for g,g' in G and H < G: 
(i) Hg' C Hg: 
(ii) g' € Hg: 
(iii) g'g7! EH. 
(Indeed, one verifies (7) © (27) & (277) directly.) 
LEMMA 9. If H < G and Hq, C Hq, then Hq, = Haq. 


Proof. Write g, = hgz. Then gz = h7tq,, so Hg. C Hai, by Remark 8, 
implying Hg, = Hq as desired. O 


Remark 9'. Ha = Hb iff ab~' € H, by Remark 8 and Lemma 9. In partic- 
ular, Ha = He iffa € A. 


Proposition 10. If Hq, 4 Hoo, then Hg, Hg, =. Thus the cosets of 
AT comprise a partition of G. 


Proof. We show the contrapositive. Assume g' € Hg, Hage; then Hg! C 
Ha, Hg, so by Lemma 9, Hg, = Hg! = Haq, as desired. 0 


THEOREM 11 (LAGRANGE’S THEOREM). If H is a subgroup of a finite 
group G, then |H| divides |G|; in fact 1¢1 is the number of cosets of H 


G 
; |A| 
in G. 


Proof. We have just seen that G is a disjoint union of its cosets, each of 
which has the same number of elements as H. 0 


Motivated by this proof, we define the index of a subgroup H in G, 
written [G : H], to be the number of (right) cosets of H in G. Note this 
could make sense even for G infinite; for example [Z : 2Z] = 2. 


Remark 11'. When G is finite, Lagrange’s theorem says that [G : H] = Tae 


On the other hand, we could have developed the same theory by using left 
cosets gH instead of right cosets Hg. Hence, by symmetry, Tal is also the 
number of left cosets of H in G. This idea is illustrated later, in the proof 
of Proposition 5.9 below. See Exercises 10.11ff for a discussion of what 


happens when we combine left and right cosets. 


Example 12: Let G = (Zg,+) and 
H = {[a] € G: ais even} = {[0], [2], [4]. [6]}. 


FT has two cosets: H itself and H+([1] = {[1]. [3], [5], [7]}. Note that |G| = 8 
and |H| = 4. 


The converse to Lagrange’s theorem is false; there is a group G of 
order 12 that fails to have any subgroup of order 6, cf. after Remark 5.23. 
Returning to cyclic subgroups we recall o(g) = |(g)|, and thus have 


CoROLLARY 13. o0(g) divides |G], for every element g of G. 


CoROLLARY 14. If |G| is a prime number p, then G has no nontrivial 
subgroups. In particular, if g € G\ {e}, then G = (4g). 


Proof. Any subgroup # {e} has some order m > 1 dividing p, implying 
m=p.0 


Fermat’s Little Theorem 

Let us see how these ideas apply to a celebrated (although now easy) the- 
orem of Pierre Fermat. We shall have occasion to refer several times to 
Fermat’s work in number theory, as a sounding board for the theory devel- 
oped here. Although a jurist by profession, Fermat was one of the great 
mathematicians of all time, who loved to tantalize his colleagues by discov- 
ering deep results in number theory and challenging other mathematicians 
to reproduce them. The result discussed here is perhaps the most basic 
theorem concerning prime numbers. 


THEOREM 15. (Fermat’s Little Theorem) If p is a prime number and 
(a.p) = 1, then a?~! =1 (mod p). 


Proof. Recall Euler(p) = {1,...,p — 1} is a group under multiplication and 
contains a, so it suffices to prove o(a)|p — 1. But this is true by Corollary 13, 
since |Euler(p)|=p—1. 0 


Fermat’s Little Theorem also helps test whether a given large number is 
prime, since it turns out that a?~! is usually not congruent to 1 mod p when 


pis not prime. Thus, given a large number p, if we compute a?~+ for twenty 


random values of a < p and always get 1 (mod p), we can be virtually certain 


that p is prime. We shall touch on a related idea in Exercise 15. Although 


this probabilistic approach might seem inappropriate for an “exact” science 


such as mathematics, primality testing for large numbers relies on the use 


of computers, so it is fair to use a result whose uncertainty is less than the 


chance of computer error. 


Exercises 


ae 


10. 


11. 


An alternate way to introduce cosets. Given H < G, define the 
equivalence a ~ b in G if ab~! € H. The equivalence classes are 
precisely the cosets of H, so some of the arguments in the text 
could be streamlined. 

A group cannot be the union of two proper subgroups A and B. 
(Hint: Ifa ¢ B and b ¢ A, where is ab?) 

Give an example of a group that is the union of three proper sub- 
groups. 

If G is a finite Abelian group, then the product in G of the elements 
of G equals the product of those elements having order 2. (Hint: 
pair off each element with its inverse.) 

If F is a field with +1 4 —1, then —1 is the unique element of 
(F \ {0},-) having order 2. 

(Wilson’s theorem) p is prime iff (p — 1)! = —1 (mod p). (Hint: 
Exercises 4 and 5 applied to the field Z,. A more sophisticated 
proof is given in Exercise 3.5.) 


Euler’s number Recall that y(m) is defined as the order of 
Euler(m). There are two formulas to compute y(m), the first indi- 
cated in Exercise10 (and given explicitly in exercise 6.7), and the 
second given in Exercise 13. 

(Euler’s theorem) Prove a? = 1 (mod m), for any relatively 
prime numbers a and m. 

If m = pip for py, p2 prime, show p(m) =m — a = as +1. (Hint: 
We exclude multiples of py and of pz, but then have excluded m 
itself twice.) 

For p prime, y(p*) = pt — p* 1 = p'(1— 5). 

Generalizing Exercise 8, show if m = pip) for p1,p2 prime numbers, 
then y(m) = m(1— al rs mae Can this be generalized to arbitrary 
numbers? (See Exercise 6.7) 

Vain O(a) = n. (Hint: Define f:{1,2,...,.n} 4 Ug,Euler(4) as 
follows: Given k <n, let d= (k,n), and send k to § in Euler(4).) 


12; 


13. 


14. 


15. 


16. 


Define the Mobius function 


1 n=1, 
p(n) = 4 0 n has a square factor > 1, 


(-1)' nis a product of ¢ distinct primes. 


Show pi(myn2) = p(n) p(n2) if n1,n2 are relatively prime. Prove 
the Mébius inversion formula for a function f:N > Z: If g(n) 
is defined by g(n) = Vian f(a), then f(r) = Yai, HCG)g(d). In 
particular, }7 4), (a) = 0 for all n > 1. (Hint: For the last assertion 
take f(1) =1 and f(n) =0 for n > 1.) 

Using Exercises 11 and 12 show y(n) = 374), (7d, for every pos- 
itive number n. 

Compute y(100). What are the last two digits of 131? (Hint: 
Translate to Euler(100).) 

Suppose p1,p2 are odd primes such that gcd(p; — 1, ps — 1) = 2, 
and let m = pip2. If a7! = 1 (mod m), then a? = 1 (mod m):; 
hence there are only 4 solutions for a. (Hint: Show o(a) divides 
p, + p2 — 2, and thus divides (p, — 1)? and (pz — 1)?; hence o(a) 
divides 4.) 

Consider the rational number 7, m <n, and its decimal expansion. 
This expansion must be infinite, iff n contains a prime factor other 
than 2,5. Every expansion “repeats.” The period of the expansion is 
the number of digits in the minimal repeating pattern. For example, 
+ = .0714285714285... has period 6. Show that the period must 
divide n — 1. On the other hand, the periods of = and = are the 
same; conclude that the period must either be 1 or even. Consider 
these assertions for other bases, i.e., binary expansion and octal 
expansion, and compare the periods of the same fraction in binary 
and in octal expansion. 


CHAPTER 3. CAUCHY’S THEOREM: How TO SHOW A NUMBER 
IS GREATER THAN 1 


In Chapter 2 we saw that Lagrange’s theorem has the consequence that if 
g € G and o(g) = m, then m divides |G]. This raises the converse question: 
“If m divides |G| then does o(g) = m for suitable g in G?” Our goal will be 
to prove this for m prime; for m not prime the result is false, as evidenced 
by Euler(8) = {1,3,5,7}, which has order 4 although each of its elements 
has order 2. In proving Lagrange’s theorem we examined the process of 
division. In studying the converse we shall learn how to count. 


THEOREM 1 (CAUCHY’S THEOREM). Ifa prime number p divides |G], then 
G has an element of order p. 


There is an easy proof for G Abelian, given in Exercise 7.1. However, 
we proceed directly to the proof in general. The proof becomes rather easy 
for p = 2, as given in Exercise 1, so the reader is advised to try that first. 


Proof of Theorem 1. Let S = {g € G: g? =e}. Clearly e € S, so |S| > 1. 
In fact g € S iff o(g)|p, t-e., o(g) = 1 or p; since e€ is the only element of 
order 1 our goal is to show |S| > 2. This is not an easy task, and the trick 
is to show p divides |S]; then || > p > 2. 

Let T = {(91..--.4p) € GK +++ X Gi gy...gp =e}. We shall show that 
p divides |S|, by counting |7'| in two different ways. On the one hand for 
any g1..--,9p—1 in G we have a unique g, in G such that (g1,....¢p) € T: 
namely gp = (g1---Gp—1) -- Thus |7| equals the number of ways we can 
choose p—1 elements arbitrarily from G, i.e., |T| = |G|?~+. Since p divides 
|G|, we see p divides |T|. On the other hand (g,..,g) € T iff g? =e , iff 
g€S. Thus, letting T’ = {(g1,....9p) € T : at least two g; are distinct}, 
we see |T'| = |S|+|T"|. We shall prove p divides |7"|, which then implies 
p divides |T| — |T"| = |S], as desired. 

To prove p divides |T7"| it suffices to partition T’ into disjoint subsets 
each having p elements. In principle this is easy, for if (g1,....9p) € T’, 
then (gi, gi+1,---:9p:91-+--,gi-1) € T' for every 1 < i < p. (Indeed 
(91----,9p) € T’ implies € = gy...dp = (91---Gi-1)(Gi--Ip). 80 Gi Ip = 
(g1-.-gi-1) 1; multiplying the other way shows 4j...9p91..-9i-1 = €-) 

It remains to show that the set {(gigi¢1 ---9p91---gi-1) 1 1 <2 < p} 
actually contains p distinct elements, 1.e., that 


GiGit1---OpM---Gi-1, 1L<a<p 


are all distinct. This part of the argument has nothing at all to do with 
the structure of G, but still utilizes some of the ideas developed earlier. 
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Suppose on the contrary 


Oigit1-- “Inf -6- GQi-1 = GiGj+1 ae “Inf ote “Gj-1; 


with 1< i<j <p. Lett= 37-12. Then g = 9; = gi4t. Looking now 
at the (t+ 1) component of each side we see gj = gj41+ = gitar (subscripts 
modulo p). Continuing in this way we have 


Gi = Gitt = Jitat = °° = Dit(p-1)t- 
By Corollary 1.15, the indices {7, i+¢, ..., i+(p —1)t} are all distinct 
modulo p and thus traverse {1, 2, ..., p} (mod p); we have shown that 
1 = 9 =+++ =p. contrary to (g1,-.-,9») € T’. O 


This proof is an example par excellence of a “combinatorial proof,” 
one that uses an ingenious counting argument, often needing few prerequi- 
sites, in order to prove a deep result. Cauchy’s theorem usually is proved 
much later in the theory, only after a sufficient body of theorems has been 
obtained to erect a “structural proof.” It is quite convenient to have it 
available at this stage. 


Remark 1'. Actually we have proved the stronger result, that, for p prime, 
the number of elements g satisfying g? = e is divisible by p. It follows easily 
that the number of subgroups of order p in G is congruent to 1 (mod p), 
cf. exercise 2. This result holds for arbitrary powers of p and is due to 
Frobenius.The proof of Frobenius’ theorem is sketched in Exercise 11.12. 


Cauchy’s theorem has many important applications, but first we use 
it to define a number related to |G]. In what follows lcm denotes the least 
common multiple. 


The Exponent 


Definition 2. The exponent of a finite group G, denoted as exp(G), is 
Icm{o(g) : g € G}. 


PROPOSITION 3. exp(G) divides |G|. Conversely, every prime number di- 
viding |G| also divides exp(G). 


Proof. Lagrange’s theorem shows o(g) divides |G| for every g in G, so clearly 
exp(G) divides |G|. The second assertion is Cauchy’s theorem. 0 


Example 4. (i) If G= (g) then exp(G) = |G], since o(g) = |G]. 
(ii) If |G| = p' for p prime and exp(G) = |G], then G = (g) for some 


t-1 


g in G (for otherwise each element has order dividing p*~*, contradiction). 


The same argument shows a more general fact: If |G| is a prime power, 
then there is some g in G for which o(g) = exp(G). Later we shall obtain 
an analogous result for arbitrary Abelian groups, cf. Remark 7.12'.) 

(iii) Exp(Euler(8)) = 2, whereas |Euler(8)| = 4. 

(iv) If |G| is a product of distinct prime numbers then |G| = exp(G), 
by Proposition 3. In particular, exp(S3) = 6 = |S3|, although $3 is far from 
cyclic (in contrast to (ii)). 


PROPOSITION 5. If exp(G) = 2, then G is Abelian. 


Proof. For any a,b in G we have a7b? = e = (ab)? = abab; canceling a on 
the left and 6 on the right yields ab = ba. O 


This result cannot be generalized directly to exponent 3, since we shall 
sec for any odd prime p there are non-Abelian groups of order p* and 
exponent p. 


S,: Our Main Example 

To gain intuition we now begin a detailed investigation of S,,, which is 
carried through the next few sections. Recall an element of S,, called a 
permutation, is a 1:1 onto transformation m:{1,...,n} > {1,...,n}. Writ- 
ing wi for the image of i, we can rewrite 7 in terms of its action on each 


1 De pects n 
Wl m2... ani 
Ved es ae which we also denote as (1). By 


12.007 


: 123 123 123 
O°. = F 
convention mo denotes toa. Thus é ; ) ¢ ‘ : (3 : a 


element, 7.€., as 


The neutral permutation is ( 


The permutations in S3 are 


(1)= 123 123 123 123 123 123 
a Mees fay rch Wel Jy Anica a Bs Dae Ac NS a aa Ws a WA cas A er A 


One can see easily that in defining a € S;, there are n choices for 71, 
n—1 remaining choices for 72, etc., and so |S,| = n(n —1)--- =n! 

S3 claims the distinction of being the smallest non-Abelian group. In- 
deed, any group of prime order is cyclic, and any group of order 4 either 
has exponent 4 (and thus is cyclic by Example 4(ii)), or has exponent 2 
(and thus is Abelian). 


Subgroups of S, 
For any k <n, S, has a subgroup consisting of those permutations that fix 
k+1, kh+2,..., n, 7e., all permutations of the form 


1 2) cee he ReR TS an. on 
wl w2 ... we k+1 1... nie 


This subgroup can be identified with $;, which has order k!. Of course Sy, 
may have other subgroups. We consider small values of n. 
Since |.S2| = 2 is prime, the only subgroups of S2 are itself and (1). 
Since |.$3| = 6, the proper subgroups of Ss must have order 1,2, or 3, 
and thus are cyclic, by Corollary 2.14; explicitly they are 


((33)) =: 

(52) = {Css} } 
((2s)\=fay.(223)}. 

((228)) = fay, (228), (228) = ((228)), and 
(G20) - {o.Gih 


Cycles 

The notation introduced above is cumbersome and, furthermore, masks 
some important information such as the order of the permutation. Toward 
this end we introduce a special kind of permutation, with a more concise 
notation. 


Definition 6. A cycle of length t, written (i, 72 ... 4) for distinct 71,...,%, 
is the permutation a defined by mi, = 72, Wig = 123, ..., Wi¢_y = 14, THe = 
41, and mi =1 for all other z. A cycle of length 2 is called a transposition 


For example, each element of S3 is a cycle, and we could rewrite the 
above list respectively as {(1), (23), (12), (123), (132), (13)}. Three of 
these are transpositions ((12), (13), and (23)). 

This notation is very useful, although slightly ambiguous in several 
aspects: 


It suppresses “n”; indeed (12) can denote € s) in S», iC : a in S3, 
and so forth. 
(ay tg 22. U4) = (ty... Ue dy) = = (4 ty tg... ), so the notation 
is not quite unique even when n is given. The notation becomes 
unique if we choose 7; minimal, as is the custom. 
If w = (41 a2 ... it) then wi, = i3, 2%, = ig and so forth, until 
ni, = 11. Likewise mi; = iz for all k, so mt = (1). We conclude that o(7) 


= t; 1.e., the order of a cycle is its length. 

1234 
2143 
o(7) = 2. However, m = (12)(34) is a product of cycles, and we would like 


S4 contains the permutation 7 = ( ) which is not a cycle since 


to pursue this matter further. We say two cycles (2, ... i) and (j1 ..- ju) 
are disjoint if {11,...,44¢} N4i1,---sJuh =O. 


Remark 7. Any two disjoint cycles commute. (Indeed if m = (21 ... 74) 
ando = (j1 ..- ju), then mot = oni =7 for 2 Z fi1,...,4} U {f1,---. du}; 
TOW, = Ki, =ipz41 (subscripts mod t) = oiz4, = ori, for 1 <k < t, and 
likewise moj, = Wjk41 = Jk+1 = Ojk = OT), for 1 <k <u.) 


THEOREM 8. Every permutation is a product of disjoint cycles (and these 
commute with each other, by Remark 7). 


No proof. The proof would be to write each cycle as it comes, but we leave 
the details to the reader, who might find the following example instructive: 


123 4 5 6 7 8 9 10 11 12 13 14 15)\ _ 
3 6 7 10 14 12 13 15 4 11 5 8 12 977 


(1.3.7 26)(4 10)(5 14 12)(8 13)(9 15)(11) 
Of course the cycle (11) at the end could be deleted. 


Remark 9. If o1,..,07, are disjoint cycles, then 
0(01...0,) =Ilem/(length(c,),...,length(o,,)). 


(Indeed (01 ...0,)” =o7”...07", from which the assertion is clear.) 


The Product of Two Subgroups 
Let us turn now to a question whose answer relies heavily on cosets and 
Lagrange’s theorem. Given H, kK < G define 


HK ={hk:h€H, ke K}. 


The question is, “Is HK necessarily a subgroup of G?” 

PROPOSITION 10. If H,K <G with HK = Kd, then HK <G. 

Proof. Suppose h; € H and k; € K; using the criterion of Proposition 2.4(1) 
we note (hiki)(hok2) + =hikyky thy’ € HKH = HAK CHKO. 
COROLLARY 11. IfG is Abelian and H, K < G, then HK <G. 


(The idea of rearranging an expression so that like elements appear 
together, recurs frequently in the sequel.) On the other hand, if Hk # 
KH, then the proof of Proposition 10 fails, and in fact HK will not be a 
subgroup; cf. Exercise 3. 


Example 12. (i) G = S4, H = ((12)), K = ((34)). Since (12)(34) = 
(34)(12), we see HK = KF is a subgroup of $4. 


(ii) G= S 3. Take distinct subgroups H, K of order 2, say H = ((23)) 
and K = ((12)). By direct computation (12)(23) = (123) € HK, so HK is 
not a subgroup of G. 


There is a neater argument for (ii). One can check easily that |H K| = 
4. Hence HK ¢ G, by Lagrange’s theorem. This argument will become 
trivial after we obtain a formula for |HK|, which is reminiscent of the 
familiar formula for the dimension of the sum of two subspaces of a vector 
space. 


THEOREM 13. If H and K are subgroups of G, then 


wer = ELL, 
lH K| 

Proof. We shall prove |HK||H K| = |H||K|, by showing that every 
element of HK can be written precisely |H M K| distinct ways in the 
form hk, for h € H and k € K. Indeed let t = |HM K|, and write 
HOOK = {a,...,a+}. For any hk in HK where h € H andk € K, we 
put h; = ha; € H and k; = iak € K; then hjk; = hk for 1 <i < t. 
On the other hand if h’ € H and k' € K satisfies h'k’ = hk, then 
hth! = k(k')-! € HO K and so equals some a;, proving h’ = ha; = h; 
and k! = (k-4a;)-1 =a; 'k=k;,, as desired. O 


CoROLLARY 14. If Hy,...,H, are subgroups of G with H;H; = H;H; for 
each i,j, then Hy, ...H; < G and |H,...H,| divides |Hy|...| At]. 


Proof. Easy induction on t, the case t = 2 being obvious from the theorem. 
Let K = Hy... Hy_1. By induction K < G and |K| divides |H,|...|Hz~1|. 
But H,...H, = K H;, so |H,... H;| divides |K||H;|, which in turn divides 
|H,|...|H,|.0 


The Classical Groups 

Let us digress for a moment, to introduce certain examples of groups of 
utmost importance in mathematics, although many of them are not finite. 
A group is called a linear algebraic group if, for suitable n and a suitable 
field F, it is a subgroup of GL(n, F) that is defined in terms of (polynomial) 
equations in the matrix components. This definition really should be framed 
with more precision, but let us illustrate it with some examples. 


Example 15: Some linear algebraic groups 


(1) The general linear group GL(n, F) is itself a linear algebraic group. 
(2) The special linear group SL(n, F) is defined as the set of n x n 
matrices with determinant 1. 


(3) The orthogonal group O(n, F) = {A € GL(n, F) : At = A~*}, where 
* denotes the matrix transpose, i.e., (a;;)' = (a;;). (Note that if 
A € O(n, F), then det(A) = +1.) (This definition is rather naive, 
and should be given in terms of quadratic forms on vector spaces.) 
(4) The special orthogonal group 


SO(n, F) = {A € O(n, F) : det(A) = 1}. 
(5) The symplectic group 


Py ys -I 
Sp(2n, F) = {A € GL(2n, F): At = (35) At a an 
where 0 denotes the n x n matrix all of whose entries are 0, and I 
denotes the n x n identity matrix. 


These groups are called classical groups (also cf. Exercises 12.9 ff), and 
each has special geometric significance. Recall that GL(n, F) is the group of 
all invertible linear transformations of the n-dimensional vector space F”; 
SL(n, F) is the subgroup of transformations preserving the volume of any 
parallelotope defined by n given vectors. 

F” can be provided with the usual inner product: 


(Q1,.-+,0n) *(Prs-+++Pn) = (arf +-+> + AnPn). 


A € O(n, F) iff A preserves orthogonality, t.e., v-w = 0 implies Av- Aw = 0. 
Sp(2n, F) can be described in terms of an analogous orthogonality condition 
with respect to a different scalar product. 

If F is a finite field, for example Z,, then the classical groups are 
important examples of finite groups. 


Exercises 


1. (Cauchy’s theorem for p= 2.) Let T= {a € G: a7! Za}. Pairing 
off each a with a7! in T, show |T| is even; hence there are an even 
number of elements a in G for which a = a7}, i.e., a2 =e. All of 
these except e itself have order 2. 

2. (Special case of Frobenius’ theorem, cf. exercise 11.12) For any 
prime p dividing |G], the number of subgroups of order p is con- 
eruent to 1 (mod p). (Hint: Let m be the number of subgroups of 
order p, and let ¢ be the number of elements (in G) of order p. By 
Lagrange, the intersection of any two subgroups of order p is trivial, 
so t= (p—1)m=-—m (mod p). Conclude using Remark 1'.) 


10. 


11. 


12. 


Conversely to Proposition 10, show that if Hk < G, then Hk = 
KH. (Hint: Use the inverse.) 

Find a noncyclic Abelian subgroup of $4. 

Another proof of Wilson’s theorem (Exercise 2.6). The number of 
cycles of length p in S, is (p — 1)!, so the number of elements of 
order dividing p in S, is (p — 1)!4+ 1. But this is a multiple of p. 
Generalize proposition 5, to show that if a,b are elements of order 2 
whose product also has order 2, then ab = ba. 

Exp(S,) =Icm{1,2,...,n}. (Hint: It is enough to check the cycles.) 
What is the maximal possible order of an element of S7? of S,, for 
n arbitrary? 


The Classical Groups 

Display SL(n, F), O(n, F), SO(n, F), and Sp(2n, FP) explicitly as 
subgroups of GL(n, F), defined by equations in the matrix entries. 
Show SL(n, Z,) has order (p”—1)(p”—p)...(p" —p”~*)p”~ 1. What 
are the orders of O(n, Z,), SO(n, Z,), and Sp(2n,Z,)? (Hint: See 
Exercise 1.6. For O(n,Z,) compute the number of orthonormal 
bases; this is quite difficult.) 

(The group of symmetries of the circle) A symmetry of the circle is 
a rigid map from the circle to itself, permitting “flips.” Show that 
the set of symmetries is a group, taking composition of maps as 
the group operation. Identify this group with O(2,R). SO(2,R) is 
a subgroup of O(2,R), of index 2, corresponding to the group of 
rotations. 

For any n, the group of rigid transformations of a regular polygon of 
n sides can be viewed as a subgroup of the group from the previous 
exercise. (Hint: Inscribe the polygon in the circle.) 


CHAPTER 4. INTRODUCTION TO THE CLASSIFICATION OF GROUPS: 
HOMOMORPHISMS, ISOMORPHISMS, AND INVARIANTS 


Having proved some rather deep theorems about subgroups, let us pause 
and take stock of what we know about specific groups. We have applied 
Euler’s groups directly to number theory. We have also seen groups aris- 
ing in other contexts, such as permutations of sets of objects, invertible 
linear transformations of vector spaces, and symmetries of the circle (and 
possibly other geometric objects). Groups thereby have acquired a special 
significance, and deserve to be studied in their own right — the goal of such 
a study being to develop the tools to answer any question posed about a 
given group G. The most basic question is, “Does G have the same struc- 
ture as a group we have already encountered?” or, in other words, “When 
are two groups the same?” To make this question precise, we consider the 
group structure more closely. Of course, the structure of the group is de- 
termined by its “multiplication table,” the list of products of all pairs of 
elements. Here are some examples of multiplication tables of groups: 


TbIWk41 TblWk42 TbIWk43 
TblIWk44 TbIWk45 
Note that the groups of the first row all have the form 
TbIWk46 


where e is the neutral element and a is the other element; however the 
structures of (Z4,+) and Euler(8) differ, as seen by examining their main 
diagonals. Thus we see that the multiplication table provides a comprehen- 
sive method of comparing structures of different groups (also cf. Exercise 1). 
But for precisely this reason, the multiplication table is far too cumbersome 
in most situations, and we must find a more concise method of comparing 
group structures. The most direct approach is to find a correspondence 
of elements that will respect the algebraic structure of the groups. Even 
when the correspondence need not even be 1:1, it can still transfer valuable 
information. We have already come across such an instance, namely the 
function Z > Z,,, which sends a to [a]. Let us generalize this example. 


Definition 1. A group homomorphism yp: G1 — G2 is a function satisfying 
(H1) y(ab) = A eae ieees 


(H2) y(e1) = e2, where e; is the neutral clement of G;; 


(H3) y(a7 i, (a)! for all a in G4. 
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Remark 2. Conditions (H2) a 
holds then e.p(e1) = yle1) = 
canceling y(e,)); then g(a7+)y 


g(a") = g(a). 


nd (H3) are superfluous! Indeed if (H1) 
y(ej) = (er), so eg = (er) (seen by 
(a) = y(a-'a) = y(e1) = eg, proving 


From now on we write e for the neutral element both of G; and of G2, 
since it is clear from the context. 


Digression 3. For the sake of completeness let us record what happens for 
monoids. Monoid homomorphisms are required to satisfy only (H1) and 
(H2). ( (H3) is no longer relevant, since inverses are not part of the defining 
structure.) But now (H2) is no longer superfluous! For example, we could 
define vy: (Z2,-) > (Ze,-) by g(a) = 3a; then (H1) holds but (T12) fails since 
p(1) =3 #1. (Where does the proof of remark 2 go awry here?) 


Definition 4. An isomorphism is a homomorphism y:G, > G» for which 
the inverse maps exists and also is a homomorphism. In case an isomor- 
phism exists from G, to G2, we say “G, is isomorphic to G2” and write 


Gi 2 Go. 


Some philosophy: Suppose y: G, 4 G» is an isomorphism. For any a 
in G, we could write a for g(a) in G2, and then note that the multiplication 
table for G, could be copied for Gz by writing bars over all the elements. 
Thus “isomorphism” is the key concept for identifying the structure of 
two groups, and our immediate goal should be to find the weakest formal 
conditions to guarantee that a given homomorphism g is an isomorphism. 
Clearly gy must be 1:1 and onto, so we first note 


Remark 5. Any 1:1, onto homomorphism vy: G, — G» is an isomorphism. 
(Indeed we must show g~1(ab) = y~1(a)y71(6) for any a,b in G. But 


(ye *(ab)) = ab = pp *(a)pe *(b) = v(y *(a)y *(6)): 
since y is 1:1 we see » 1! (ab) = ep Na)y 1 (0).) 
This leads us to separate the key features of isomorphism into two 
parts: 


Definition 5'. An injection is a 1:1 homomorphism; a surjection is an onto 
homomorphism. 


How can we tell whether a homomorphism is an injection? The sign 
of a good theory is that it provides guidance at times of need. At this 
juncture we note that any given homomorphism y:G, — G» naturally 
gives rise to a certain subgroup of G1. It is convenient to use the following 


set-theoretic notation: Given a function y:G; 3 G, and S C Gp» write 
gp 'S for {g € Gi : p(g) € Sf. 


Definition 6. The kernel of a homomorphism y:G, > G2, denoted ker y, 
is {a € Gi: p(a) =e} = “{e}, where e is the neutral element of G2. 


Remark 7. ker is a subgroup of G1; this is left as an exercise for the 
reader. (This remark will be improved in chapter 5.) 


Certainly if y is 1:1, then ker gy = {e}; the surprise is that the converse 
is also true. 


Remark 8. If ker p = {e}, then ¢ is an injection. In fact, for any homomor- 
phism ¢, if ¢(g1) = ¢(g2) then p(gi9y') =e, so gigs’ € ker, implying 
n € (ker y) go. 


Thus we see that an isomorphism is a surjection whose kernel is trivial. 
The task remains of determining that two given groups G; and G» are 
isomorphic, by constructing the isomorphism; we shall return later to this 
issue. Now we want to deal with the possibility that G,; and G2 are not 
isomorphic. Obviously we do not want to have to check that every map 
from G to Gy» is not an isomorphism, so we look for a subtler approach. 


Definition 9. An invariant is a number associated to a group, which is the 
same for isomorphic groups. 


Example 10. |G| is an invariant. Indeed if y:G1, 4 G» is an isomorphism 
then |G1| = |G2| since y is 1:1 and onto. 


The exponent of G is also an invariant, as we shall see from the fol- 
lowing fact: 


Remark 11. If g:G, + G2 is a homomorphism, then o(y(g))|o(g) for all 
g in Gy. (Indeed if n = o(g) then y(g)” = y(g") = e, so o(y(g))|n by 


Proposition 2.6.) If y is an isomorphism then o(y(g)) = o(g), seen by 


applying this observation also to y7!. 


PROPOSITION 12. Exp(G) is an invariant. 
Proof. Suppose y:G, > G, is an isomorphism. Then 


exp(G1) = lem{o(g) : g € Gi} = lem{o(y(g)):g € Gi} 
=exp(G2). O 


For example Euler(8) has exponent 2 and thus cannot be isomorphic 
to (Z4,+). We shall introduce new invariants as the occasion requires. 


Homomorphic Images 

Recalling that homomorphisms can relate two groups that are not nec- 
essarily isomorphic, we are interested in information concerning arbitrary 
homomorphisms, including what happens to subgroups. First we introduce 
an important subgroup of G2 arising from a homomorphism y:G,; > G2, 
namely the image (G1), and obtain a key generalization of Example 10. 


PROPOSITION 13. Ify:G, % G2 is a homomorphism of finite groups, then 
(G1) is a subgroup of Gz, and |y(G1)| = [G1 : ker ¢]. 


Proof. y(91)(92)~' = y(q93 *) € (G1) so y(G1) < Go, proving the first 
assertion. Recall that [G1 : ker y] is the number of cosets of ker y, which it 
remains to prove equals |y(G1)|. Letting K = ker y we claim that induces 
a 1:1 correspondence : {cosets of K} — (Gi), given by Kg 4 ¢(g). 
Indeed © is well-defined and onto, since y(ag) = y(a)y(g) = v(g) for any 
ain K; also, ® is 1:1 by Remark 8. O 


This result justifies our intuition that ker y is the information “lost” 
in applying the homomorphism y. Let us use this approach to determine 
the subgroups of (Z, +). We start with a key observation. If g: G1 > G 
is a group homomorphism and H < G,, then, viewing the action of y on 
HA only, we get a homomorphism y: H > G2, called the restriction of y to 
H, denoted lz. 


PROPOSITION 14. Suppose y: G1 > G, is a group homomorphism. 
(i) If H < Gy, then p(H) < Go. 
(ii) Conversely if Hy < G2, then p~'(Hp2) is a subgroup of G, containing 
ker yp. 
(iii) If y is onto, then |H2| = [p~‘(H2) : ker g] for all Hy < Go. 


Proof. (i) By Proposition 13 applied to »|z. 
(ii) kery = yp t{e} C py '(A2). To see that gp 1(H2) < Gi, take 
a.b € g7!(Hz); then y(ab~!) = p(a)y(b)7! € Hy, so ab! € p(y). 
(iii) Apply Proposition 13 to the restriction of y to gy '(A2). 0 


This useful result can be reformulated more concisely. 


PROPOSITION 15. Suppose yp: G1 > G» is a surjection. Then » induces a 
1:1 correspondence 


®: {subgroups of G, containing ker p} — {subgroups of G2}, 


and ®~! is induced by y7!. 


Proof. One must show that y~!w(H,) = A, for any Hy, < G, containing 
ker y, and yp71(H2) = Hy» for any Hz < Gz. The second assertion is imme- 
diate since y is onto; the first assertion is clear, for if a € p~ty~(AM,), then 
(a) = y(b) for some b in Hy, implying a € (ker y)b C Hy, by Remark 8.0 


COROLLARY 16. The subgroups of Z,, are all of the form nZ,,, for suitable 
nim. 


Proof. We apply Proposition 15 to the natural surjection y:Z > Z,,. The 
subgroups of Z containing ker p = mZ have the form nZ for some n|m in 
N, and the image is nZ,,. O 


COROLLARY 17. For any m, Z,,, has a unique subgroup of order n, for each 
n dividing m. 


Proof. Tt must be Zm, in view of Proposition 14(iii). O 


Exercises 


1. Each row (and each column) of the multiplication table of a group 
is in 1:1 correspondence with the set of elements of the group. 

2. Write a computer program to find all groups of a given order n, 
but throwing away all “duplications,” «.e., when two groups are 
isomorphic. Use this program to construct all groups of order < 8. 
What interesting facts can you conjecture from this list? As the 
course progresses, improve this program to take into account the 
various theorems. 

3. Define a homomorphism GL(n, F) > F\{0}, given by “det.” What 
is the kernel? 

4. Define a homomorphism O(n, F) — {41}, given by “det.” What is 
the kernel? 

5. Use Exercises 3,4 to calculate the orders of SL(n, Z,) and SO(n, Z,). 

6. (cf. example 2.4") Construct a natural homorphism from the group 
of invertible upper triangular matrices to the group of diagonal ma- 
trices. What is its kernel? 

7. Any monoid homomorphism f:M, > Mz restricts to a group ho- 
momorphism f:Unit(M@,) > Unit (M2). 

8. If m|n, then there is a group homomorphism Euler(n) > Euler(m) 
given by [a] + [a]. What is the kernel? 

9. Suppose there is a surjection f:G,; > G2. If Gy has an element of 
order m, then so does G,. (Hint: Apply remark 11.) 


CHAPTER 5. NORMAL SUBGROUPS — THE 
BUILDING BLOCKS OF THE STRUCTURE THEORY 


We have seen that the kernel of a group homomorphism y:G > 4 is a 
subgroup of G. Our object here is to study a special property of this kind 
of subgroup, which turns out to be the key to the whole structure theory. 


Remark 1. If a € ker gy and g € G, then gag™! € ker vy. Indeed, 
y(gag ') = p(gey(g ') =e. 


Let us formalize this property. 


Definition 2. A subgroup N of G is normal, written N «G, if gag~! € N 
for alae N, g EG. 


We call an element of the form gag~! the conjugate of a by g; thus 
N is normal iff N contains every conjugate of each element of N. This 
property can be written more suggestively as gNg~! C N for all g in G, 


1 


where gNq7! denotes {gag~! : a € N}. the set of conjugates of elements 


of N by g. 


PROPOSITION 3. The following conditions are equivalent: 
(i) N«G; 
(ii) gNg7! CN for all g in G; 
(iii) gNg~! =N for all g in G; 
(iv) gN = Ng for all g inG. 


Proof. (i) & (ii) by definition. 

(ii) = (iii) We need the opposite inclusion to (ii); using g~* instead 
of g in (ii) we get g ‘Ng =q 'N(g'') | CN, so N = gg 'Nqq' = 
gg *Ngjg'' CgNg'. 

(iii) > Gi) Obvious. O 

(iii) & (iv) Clear (multiplying on the right by g or g~'). 0 


1 


Example 4. GG and {e}4G, for any group G. These are called the trivial 
normal subgroups of G, and we shall be interested in the nontrivial normal 
subgroups. On the other hand any subgroup of an Abelian group is normal, 
so the question of normality is pertinent only for nontrivial subgroups of 
non-Abelian groups. 


Example 5. G = S3 (the smallest nonabelian group). In Chapter 3 we saw 
that the nontrivial subgroups of S3 are ((12)), ((13)), ((23)), and ((123)). In 
the appendix to this section we shall see by inspection that ((12)), ((13)), 
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and ((23)) are not normal subgroups. One can also check that ((123)) is a 
normal subgroup by direct computation, but we shall now see two ways of 
obtaining this information with ease. 


LEMMA 6. For any H < G and any g in G, gHg7! is also a subgroup of G, 


and there is a group isomorphism ¢: H — gHg™ given by ¢(a) = gag". 


Proof. If gag~', gbq7! € gHg™' then 


gag *(gbg-*)* = gag *gb-*g°' = gab“*'g"* € gHg"*, 
proving gHq~! < G by our usual criterion of subgroup. Now ¢ is a group 
homomorphism, since 


1 


y(ab) = gabg* = (gag *)(gbg *) = vla)y(o). 


1 


and ¢ is obviously onto. Finally ker »y = {a € H : gag~* = ec} = {e}, seen 


by multiplying by g~! on the left and g on the right. 0 


Stated in words, conjugation by any element of G yields an isomor- 
phism of subgroups. 


PROPOSITION 7. Suppose G has a unique subgroup H of order m. Then 
AAG. 


1 


Proof. For any gin G, gHg™! is a subgroup of order m, so gHg-' = H.O 


CoROLLARY 7’. If p is prime and |G| = pu for u < p, then G has a unique 
subgroup of order p, which thus is normal. 


Proof. By Cauchy’s theorem G has a subgroup A of order p. H is unique 
(of order p), since if K # H with |K| =p, then HM K is a proper subgroup 
of H, which must have order 1 by Lagrange; hence p? = |H||K|/|Hn K| = 
|HK| <|G| < p’, contradiction. Thus H<4G.0 


Example 8. 6 = 3-2. Hence ((123)) is the unique subgroup of 53 having 
order 3, and so is normal. 


Here is an even easier approach. Recall that [G : H] = $ when |G| 


is finite. 
PROPOSITION 9. If [G: H] =2, then H«G. 


Proof. For any g in G we need to show Hg = gH. If g € H then Hg = 
H = gH, so we may assume g ¢ H. But then A and Hg are the only 
two (right) cosets of H in G, by Lagrange, implying G = H U Hg; and 


analogously using left (instead of right) cosets we have G = H U gH, both 
disjoint unions. Thus Hg = G\ H = gH, as desired. 0 


Note that [$3 : ((123))] = 2, yielding ((123)) <4.S3. This nice result will 
be generalized in Chapter 9. 


Remark 10. If H < Gand N«G, then HN < G, as a special case of 
Proposition 3.10. (Indeed hN = Nh for all h in H,so HN = NF.) 


Without computing, one concludes at once from this result that the 
subgroups of $3 having order 2 are not normal, cf. Exercise 1 (which will 
be generalized in chapter 11). 


The Residue Group 
So far we have seen that ker p4G, for any group homomorphism y:G > K. 
Now we want to show the converse, that every normal subgroup is the kernel 
of a suitable group homomorphism. In other words, given N <G we need to 
construct a group K and a homomorphism y:G > K such that kerg = N. 
When G is finite, |g(G)| = [G : N] by Proposition 4.13. On the other hand, 
N has [G : N] cosets in G, and this fact was instrumental in proving that 
proposition. Thus it makes sense to try to define a group structure on the 
cosets of N as our candidate for K. The obvious operation on cosets would 
be 
NaNb = Nab (1) 

if this can be shown to be well-defined. So suppose Na = Na’, and Nb = 
Nv! for a',b' in G; is Nab = Na'b'? Write a’ = xa and b! = yb for zy 
in N. Then a’b! = xayb = x(aya7")ab € Nab, since aya~! € N. Hence 
Na'b! = Nab by Proposition 2.10. 
Definition 11. G/N denotes {cosets of N}, endowed with the operation 
defined in (1). 

It must be stressed that in G/N the cosets Na play the role of ordinary 
elements and should be considered as such. 


THEOREM 12. G/N is a group, with neutral element Ne. Furthermore, 
there is a surjection »:G— G/N given by at Na, whose kernel is N. 


Proof. To prove G/N is a group, we simply transfer the analogous proper- 
ties from G to G/N. For any a in G we have 


NaNe = Nae = Na = Nea = NeNa, 


proving Ne is the neutral element. Associativity follows from the compu- 
tation 


(NaNb)Ne = NabNce = N(ab)c = Na(bc) = NaNbe = Na(NbNe). 


Finally, (Na)~! is Na~? since 
Na 'Na= Na~‘a= Ne = Naa! = NaNa'. 


Having proved G/N is a group, we next check y:G > G/N is a group 
homomorphism: 


(ab) = Nab = NaNb = )(a)4)(b). 


For the last assertion, kery) = 7)~1(Ne) = {a € G: Na = Ne} = N, by 
Remark 2.9’. 0 


The group G/N is called the residue group, factor group, or quotient 
group. The homomorphism of Theorem 12 is called the canonical homo- 
morphism. Let us now consider three instances when N 4G and determine 
the structure of the group G/N. 


Example 18. 
(i) G=(Z,+) and N = nZ. Then 


G/N ={N+0, N+1, ..., N+(n—-1)}, 


with the group operation given by (N +a)+(N+6)=N-+(a+0). This 
eroup is identified naturally with (Z,,,+) if we write [a] in place of N +a. 
(ii) G = (Ze,+) and N = 3Z, = {[3a] : a € Z}. Then G/N = 
{N+ [0], N+[1], N+ [2]}, which is identified naturally with (Z3,+). 
(iii) G= S; and N = ((123)). Then G/N = {Ne, Ng} where g = (12). 
Note that NgNg = Ne. Thus we can identify G/N with ({+1},°). 


Please note that (i) provides an instance where G/N is not isomorphic 
to a subroup of G. (This observation should help one to avoid a common 
error.) Our next goal is to view (i), (ii) in their general contexts. 


Noether’s Isomorphism Theorems 

Suppose y:G —> K is a surjection, and N = ker wy. We have just seen that 
N<«G, and G/N is a group whose order is [G : N] = |K| when K is finite. 
We would like to show that G/N = K. Given any N«G and N CH <G, 
write H/N for {Nh:h € H}, easily seen to be a subgroup of G/N. 


LEMMA 14. Suppose y:G > K is any group homomorphism, and N is a 
normal subgroup of G contained in ker gy. Then there is a homomorphism 
p: G/N > K given by @(Nq) = v(g), and ker ¢ = (ker vy) /N. 


Proof. @ is well-defined, for if Ngi = Ngo, then gi = ag2 for some a in N, 
and thus 


(Nagi) = (a1) = vlag2) = p(a)y(g2) = P92) = (N92). 


The rest is easy: y is a homomorphism since 


P(NaNgz) = O(Nq192) = (9192) = (91) (92) = O(N g1) (Na). 
and kerg@ = {Ng : o(g) =e} = {Ng : 9 € kery} = (kery)/N by defini- 
tion. O 
Remark 15. In Lemma 14, ¢ is onto iff ~ is onto. 

THEOREM 16. (Nocther I) Suppose »:G > K is any surjection. Then 
K & G/ ker y. 

Proof. Take N = kery in Lemma 14. Then ¢:G/N —> K is onto, and 
ker 6 = N/N = Ne, the neutral element of G/N, so ¢ is an isomorphism. 0 
Example 16'. The natural surjection (Z,+) > (Zn,+) has kernel nZ, so 
(Z[nZ,+) & (Zn, +). (Compare with Example 13(i).) 

THEOREM 17. (Noether II) If N C H are normal subgroups of G, then 
G/H =~ (G/N)/(H/N). 


Proof. The canonical homomorphism y:G — G/H is onto and has kernel 
H containing N, so Lemma 14 provides a surjection ¢:G/N > G/H with 
kernel H/N; we conclude with Noether I. 0 


Example 17'. Define G = Z, N = 6Z, and H = 3Z. Then Z/3Z = 
(Z./6Z)/(3Z/6Z) & Ze6/3Z—. (Compare with Example 13(ii).) 

THEOREM 18. (Noether III) If N«G and H < G, then HN «FH and 
H/ HON = NHN. 


Proof. Define yp: H ~ NH/N by hw Nh. Clearly ¢ is onto, and ker ¢ = 
{he H:Nh=ech={he€ H: he N} = HON, so we conclude with 
Noether I. 0 


For any H < G, the proof of Theorem 18 shows that NH/N is the 
image of H in G/N under the canonical homomorphism G > G/N. In 
particular we have 


Coro.tiary 19. If N«G and H < G with HON = {e}, then FH is 
isomorphic to its canonical image in G/N. 
Proof. NH/N = H/(HON)=H/{e} x 7.0 


These three theorems of Noether (especially Nocther I) are powerful 
tools for building isomorphisms. Let us also record an application of Propo- 
sition 4.15 to G/N. 


Remark 19’. Suppose N<G. If N < H < G, then H/N < G/N; conversely 
every proper subgroup of G/N has the form H/N for N < H < G. 


Conjugates in S;, 

The computation of conjugates (used in the criterion for normality of a 
subgroup) is surprisingly straightforward in S,, and we shall describe it 
here. 


Remark 20. Suppose o,a are arbitrary permutations, and write o as a 
product of disjoint cycles 


o = (41... 44, (te, 41 -- Bty Utd ee ttg) eee (2) 
In any given cycle, we see that oi is the entry following 7. Now observe 
what maz~* does to mi. 
non !(wi) = (a7); i.e., (ai) follows az the same way that a7 fol- 
lows 7. Thus we see mam! is the product of disjoint cycles 


(arty até «Tit, )( Wh, 44 28 Tt, (Wet, 44 a . Tit, ) ayia e (3) 


This computation will be the key to many observations about per- 
mutations. In particular we see that each conjugate of o can be written 
as a product of disjoint cycles of the same respective lengths as the cy- 
cles of o. More colloquially, we say o has the same placement of paren- 
theses as each of its conjugates. On the other hand, if o is as above and 


T = (j1--- Ft: )(Ger41- ++ Sta )Gta41--+Jtg)-.-, Le., T has the same placement 
of parentheses as a, then tT = tam! where 7 = é: . > tae i) . Therefore, 
fal des. 2s 


the conjugates of o are precisely those permutations that can be written as 
a product of disjoint cycles with the same placement of parentheses. We 
shall consider this issue in more detail in Chapter 10, when conjugacy is 
used to define an equivalence relation. 

Now we apply this argument to determine the nontrivial normal sub- 
groups of S3 and $4. (13)(12)(13)~! = (32) ¢ ((12)), which thus is not a 
normal subgroup of $3; analogously ((13)) and ((23)) are not normal sub- 
groups. On the other hand every conjugate of (123) is a cycle of order 3 
and thus can be rewritten as (123) or (132) = (123)?, and so belongs to 
{(123)) ; this proves (for the third time) ((123)) 453. 

In 4, every permutation is conjugate to either (1234), (123), (12), (1), 

r (12)(34). In fact the set of conjugates of (12)(34) is precisely 


{(12)(34),  (13)(24),  (14)(23)} 
(since for example (42)(13) = (24)(13) = (13)(24)): adjoining (1) to this set 
yields a subgroup K of four elements called the Klein group. Since every 
conjugate of each element of K is in K’, we see in fact K «$4. Note that 
exp(K) = 2. 
S4, has one other nontrivial proper normal subgroup, which we now 
consider. 


The Alternating Group 

We use these ideas to construct an interesting nontrivial normal subgroup 
of S,,, in fact the only one for n > 4. Any permutation m can be written as a 
product of disjoint cycles C, ...Cy; any cycle C = (a1 ...%;) can be written 
as the product of t — 1 transpositions (21 74)(41 a1)... (41 22). Hence any 
permutation a can be written as a product of m transpositions, for suitable 
m. Note that m is not well-defined, since (12) = (12)(12)(12). However, we 
shall see now that m is well-defined (mod 2). Thus we can define the sign 
of a, denoted sg(a), to be (—1)™; is called even (resp. odd) according to 
whether m is even (resp. odd), t.e., whether sg a = +1 (resp. = —1). 


THEOREM 21. There is a (well-defined) homomorphism S, > ({+1}.-), 
given by m+ sg(7). 


Proof. First define 


(a) = [J 


et. Cae 
w>J J 


We claim (ma) = 4(a)¢)(c) for all permutations 7,0. Indeed, 


pay yy a diate 


Lo 01 — Oo Lon 
>j J i>j Yay J 


(since o merely permutes the indices, and na = = ee 
In particular, x is a homomorphism, so it remains to show (a) = 
se(a). First we check it for 7 a transposition; to simplify notation we assume 


= (12): 


=—=1 for i97>2. 


Multiplying these three equations together for all possible 7, 7 yields 


((12)) == =(-1)-1-1=-1. 


i 
t>y J 


In general, writing an arbitrary permutation 7 as a product of m trans- 


positions 01...0m, yields ob(m) = (01)...4b(om) = (—1)™” = sg(7), as 
desired. 0 


There are proofs of Theorem 21 which are more intrinsic to the defini- 
tion of permutation, cf. Exercises 3,4. 


Definition 22. Ay, the alternating group, is the kernel of the group homo- 
morphism 4 of Theorem 21, 7.e., A, = {even permutations of S',}. 


Remark 23. An< Sp, being the kernel of a homomorphism. Furthermore, 

ae ‘ye : : — |Srn} _ nt 
Sn/An & ({£1}.-) by Noether I. Proposition 4.13 implies |A,| = 32 = =. 
Thus |A3| = 3, implying A; = ((123)) & (Z3,+). However A, is new. Let 


us list its 12 elements: 
(123) (132) (124) (142) (134) (143) (234) (243) 


(1) (12)(34)  (13)(24) (14)(23) 


Each element on the top row has order 3, whereas the bottom row comprises 
the elements of the Klein group, which has exponent 2. Thus exp(A4) = 6. 

One interesting feature of A, is that it has no subgroup of 6 elements 
(and thus is the promised counterexample to the converse of Lagrange’s 
theorem). Indeed suppose N < A, had order 6. Then [Ay : N] = 2 so 
N «A, by Proposition 9. But N would have an element of order 2, which 
we may assume is a = (12)(34). Then (14)(23) = (123)a(123)"' € N 
and (13)(24) = (124)a(124)~' € N, implying N would contain the Klein 
eroup (of order 4), which is impossible by Lagrange’s theorem, since 4 does 
not divide 6. Other properties of A, are given in Exercises 4 and 6, and 
Example 12.5. The most important property in this course is the following: 


THEOREM 24. A,, is the only nontrivial proper normal subgroup of S,,, for 
each n > 5. 


Proof. Suppose {e} # N<S,. We shall show A, C N, and thus N = A,, or 
S,, by Remark 19’. It suffices to show that N contains every product 7 of 
two transpositions, since every even permutation is a product of these. But 
any such 7 has the form (21 @3 73) or (41 22) (23 24). In fact it is enough to show 
that each (71 22 73) € N, since then (21 4 23)(d1 %2 23) = (21 22) (43 t4) € N. 
We shall prove that (123) € N; analogously each (21 22 73) € N. 

Write any o # 0 in N as a product of disjoint cycles C\C,...C;, ar- 
ranged in decreasing length. Write C, = (i172 ...%m). 


Case I. m > 3. Let Cy = (tgt1%3..-im). Then CiC,...C; has the same 
placement of parentheses as 0, so is a conjugate 7 of a, and thus is in N. 
Hence 

to 1 =CiC{' = (irisis) € N, 


so N contains (123), as desired. 


Case IT. m = 2. Then t > 2, and Cy = (t172), Cy = (t3t4). Let Cl = (4123) 
and C} = (i274). Then C}C3C3 ...C; has the same placement of parentheses 
as 7, so is a conjugate 7 of a, and thus is in N. Hence 


ta — C5080? = (4124) (2923) € N, 


so N contains (13)(45) and (12)(45), and thus also their product (123), as 
desired. 0 


Note that we needed n > 5 to escape from the Klein group in Case II. 


Exercises 


di 


If G has at least two subgroups of prime order p, and p? does not 
divide |G], then no subgroup of order p is normal. (Hint: Otherwise 
HK < Gof order p?.) Find an analogous assertion for higher powers 
of p. 


. If N«G and g € G has order n, then Ng (viewed as an element of 


G/N) has order dividing n. Give an example where equality does 
not hold. 


S, and A,, 

There are many proofs of Theorem 21 in the literature. Here 
are two good alternative approaches; Exercise 4 contains the fastest 
proof that I know, but is a bit tricky 
Any permutation a can be written as a product of disjoint cycles 
C,...Cu, whose lengths t,,...,¢, (when written in ascending or- 
der) are unique. Define é(7) = t) +--+: +t, — wu, which is clearly 
well-defined, and show 7 4 (—1)") is a homomorphism. (Hint: 
By induction, one need merely check the product of a cycle and a 
transposition.) 

Show directly that no permutation is both even and odd. (Hint: 
Otherwise one could write e = (a b)(c d)... as a product of an 
odd number of transpositions; of all such products beginning with 
a, take that one in which a appears the smallest possible number of 
times, and such that the appearances of a are as far to the left as 
possible. Since (7 a) = (a 7) we may assume a never appears on the 
right side in a transposition. Clearly a appears at least twice. But 
(2 7)(a 2) = (a7 t) = (a7)(% 7), so the product must start (a z)(a 7); 
then (a z)(a 7) = (a j 7) = (a 7)(4 9), which has fewer occurrences 
of a, contradiction.) 

The only nontrivial normal subgroup of A, is the Klein group. 


If ()4N<«A, for n > 5 then N = A,. (Hint: Mimic the proof of 
Theorem 24. It is enough to show (123) € N; take care to conjugate 
by even permutations only.) 

. If H < S, and A contains every transposition, then H = S',. 

. If H is a subgroup of S, containing g = (12... n) and 7 = (1 2), 
then H = S,. (Hint: o'ta~* = (i i +1) € H. But then one has 
(@t+1)(@-1ia)Gt4+1) = (¢-—1i4+1) € A. Continuing in this way 
produces all the transpositions.) 

. When do two permutations ¢ and 7 commute? (Hint: When does 
oto + = 7? Be careful to treat cases such as o = (12)(34) and 


7 = (13)(24).) 


CHAPTER 6. CLASSIFYING GROUPS — 
CYCLIC GROUPS AND DIRECT PRODUCTS. 


The classification problem for finite groups is the question of how to 
list all finite groups “up to isomorphism,” 7.e., one representative for each 
class of isomorphic groups. Having such a list, presumably we could verify 
assertions about groups simply by going down the list. However, there are 
several difficulties with this approach. First of all, the list must be infinite, 
since (Zm,+) is a finite group for each m. Moreover, even if we had a 
complete list of finite groups, it might be impossible in practice to verify a 
given assertion. For example, 1,2,3,... is a list of the natural numbers, but 
we do not know if there are any odd perfect numbers. Finally, the list of 
finite groups might be less enlightening than the body of theorems used to 
obtain the list. Nevertheless, the classification problem is the focal research 
problem in the theory of finite groups, and the ensuing results have been 
of great use. 

Given a formidable problem such as classification, we would like to 
start by breaking it up into manageable parts. Thus we shall consider first 
only certain kinds of groups. 


Cyclic Groups 
Definition 1. A group G is cyelic if G = (g) for some element g in G. 


For example, (Z,+) = (1), since each positive number is a sum of 1’s. 
Likewise (Zn, +) is cyclic for each n. Surprisingly this gives the complete 
list of cyclic groups. 


THEOREM 2. (The classification of cyclic groups) Every cyclic group G is 
isomorphic to (Z,+) (for G infinite) or (Z,,+) (for G finite and n = |G|). 
More precisely, writing G = (g) we have a surjection yg : (Z,+)— G given 
by p(m) = g™. If kery = {0}, then y is an isomorphism; if ker p = (n) 
then G & (Zy,+). 

Proof. We prove the second assertion, which in turn implies the first. 

g(m +m!) = g™F™ = gg" = v(m)y(m') 

for all m,m! in Z, proving y is a homomorphism that clearly is onto. If 
kery = {0}, then g is an isomorphism, so assume kery # {0}. Then 


kery = nZ for some n, by Example 2.6’. But then Noether I implies 
Ge Z/nZ~ (Zy,+). 0 


(Zn,+) has taken on such importance that we can refer to it merely 
as Z,; the operation “+” is understood. Z, also is denoted C,, in the 
literature. 
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COROLLARY 3. If |G| =p is prime then G & Z,. 
Proof. G is cyclic by Corollary 2.14, so apply the theorem. O 
Here is an application of the classification of cyclic groups. 


COROLLARY 4. Every subgroup of a cyclic group G is cyclic. Furthermore, 
if G = (g) is cyclic and m divides n = |G|, then G has a unique subgroup 
of order m, namely (g"/™). 


Proof. We may assume G = (Z,+) or G = (Zn, +) (and g = 1), so we are 
done by Example 2.6’ and Corollaries 4.15 and 4.16. (The last assertion is 
clear since o(4) =m.) 0 


Generators of a Group 
There are noncyclic Abelian groups, for example, Euler(8). Toward under- 
standing such groups we need to generalize “cyclic.” 


Definition 5. A group G is generated by a set S, written G = (S), if 
G has no proper subgroups containing S. If G is generated by a finite 
set S={g1,..., 92}, we also say G is generated by the t elements g1,..., 91. 
G is finitely generated if G is generated by a finite set. 


Intuitively G is generated by S' iff every element of G can be written 
as an expression involving the elements of S', by means of the group oper- 
ations (the group product and inverse), in a manner to be made precise in 
Exercise 8.18. Of course G is generated by 1 element iff G is cyclic. 


Remark 6. (i) If S C ($") and G = (S), then G = (S"). 

(ii) If g:G > K is an onto homomorphism and G is generated by S, 
then y(G) is generated by y(S). (Indeed suppose y(S) is contained in a 
subgroup of ¢(G); this subgroup has the form y(H), where ker p < H < G. 
Clearly S C H implying H = G, so we conclude y(H) = v(G).) 


An example of a noncyclic group generated by two elements is Euler(8), 
which is generated by [3] and [5], since [1] = [3]? and [7] = [3] - [5]. On the 
other hand, obviously any group G generates itself, so every finite group 
is finitely generated. S, is generated by two elements, by Exercise 5.8, 
so we may wonder how to construct a group that cannot be generated by 
two elements. The answer will be trivial by the end of this discussion; the 
solution is to find a way of piecing together small groups. 


Direct Products 


Definition 7. The direct product of two monoids H and K is the Carte- 
sian product H x K, endowed with componentwise multiplication, 7.e., 


(hy, kt) (ho, ke) = (hyhe, ky ke). 


The verifications of the following straightforward facts hinge on check- 
ing each component. 


Remark 8. The direct product H x K of two monoids is a monoid, with 
neutral clement (e#,ex), and has the following additional properties when 
AT and K are groups: 

(i) H x K is a group, with inverses given by (h,k)7! = (h7!,k7?). 

(ii) There are natural injections H ~ H x K and K > H x K given 
respectively by h + (h,e) and k ® (e,k). 

(iti) There are natural surjections H x K + H and H x K — K given 
respectively by (h,k) H Ah and (h,k) H k. These are called the 
projections onto H and K respectively. 

(iv) If o(h) = m and o(k) = n in H,K respectively then o(h,k) = 
Icm(m,n). (Indeed let ¢ = o(h,k) and t! = lem(m,n). Then 
(h.k)* = (h* kt) = (e,e) so tlt’. On the other hand (e,e) = 
(h,k)t = (ht, kt) so ht = e and kt = e implying m|t and nt, and 
thus ¢'|t.) 

(v) |H x K|=|H||K| since this already holds for Cartesian products; 

(vi) exp(H x K) = lem(exp(H),exp()), by (ii) and (iv); 

(vii) If H, » Hy and Ky » Ky then H, x Ky = Hy x Ky. More precisely, 
ify: Hy, > Hy and w : K, > Ky are the given isomorphisms then 
the desired isomorphism y x y) : Hy, x Ky, > Hy, x Ke is given by 
(hihi) + (p(hi), (hr). 

(viii) If H and K are Abelian then H x K is Abelian. 


Example 9. Z2 x Zz is an Abelian group of order 4 and exponent 2 and 
obviously is not cyclic. 


We would like to push Remark 8(ii) a bit further, in order to identify 
Zo X Zo with Euler(8) and with the Klein group. 


Remark 10. Let H,K be the respective images of H,K under the homo- 
morphisms of Remark 8(ii). Then H = H x {e} and K = {e} x K are 
isomorphic respectively to H and K and are each normal subgroups of 
A x K. Furthermore the following properties hold: 
(i) HN K = {(e,e)}; 
(ii) HK =H x K (since (h.k) = (h,e)(e,k)): 
(iii) Every element of H commutes with every element of K (since 


(h,e)(e,k) = (h,k) = (e, k)(h,e)). 


In fact, assertions (i) through (iii) of Remark 10 characterize the direct 
product construction, as we shall see now. 


Internal Direct Products 


PROPOSITION 11. Suppose a group G has subgroups H and K satisfying 
the conditions 

DP1 HK = fe}; 

DP2 HK =G; 

DP3 Every element of H commutes with every element of K. 
ThenGrH x K. 


Proof. Define y: H x K > G by (h,k) 8 hk. pis a homomorphism since 


p((hi,k1)(h2,k2)) = p(hihe, kik) = (hih2)(kik2) 
= hy (ha ky )ko = hy (ki hz )ke 
= (hy ki) (hok2) = p(hi, ki) pe (ha, ka). 


Also is onto, by DP2. Finally ker gy = {(h,k) : hk =e}. But hk =e 
implies h = k71 € HN K = {e} by DP1; thus (h,k) = (e.e). proving ¢ is 
dey 


We shall say G is an internal direct product of subgroups H and K when 
the criteria of proposition 11 are satisfied. Note that DP3 is superfluous 
when G is Abelian, so in this case DP1 and DP2 suffice to prove G is 
isomorphic to H x Kk. Let us find further reductions in verifying an internal 
direct product. 


Remark 12. Suppose H and K are subgroups of a group G. 


(i) If DP1 holds and |G| = |A||K| then DP2 holds by Theorem 3.13, 
A\|K G 
since |H K| = aaa = 1g} = |GI. 

(ii) Suppose |G] = |H||K| with |H|,|K| relatively prime. Then DP1 


holds since |H M K’| divides (|H|,|A|) = 1; thus DP2 holds by (i). 


There is an interesting alternative formulation of DP3. 


PROPOSITION 13. Suppose H and K are subgroups of G satisfying DP1 
and DP2. Then H and K are both normal subgroups of G iff DP3 holds. 


Proof. (=) Take any h in H and k in K, and let g = hkh~tk7+. Then 
g = (hkh")k-! € K, but also g = h(khk71) € H; hence g EC HK = fe}. 
so hk = kh. 

(<=) We shall show ghg~! € H, for any g in G and h in H. Indeed 
write g = hyk, for hy in H and ky, in K, and note ghg"! = hk hk thy! = 
hyhhy* € H, proving H «4G. Likewise kK «G. 0 


The idea of using hkh~!k7+ will also be used extensively in Chapter 
12. Let us see some applications. 


PROPOSITION 14. Suppose G is a group of order 4 and exponent 2. Then 
Ge Vi x Zo. 


Proof. Gis Abelian, by Proposition 3.5. Take any two distinct elements 
a,b # e. By hypothesis o(a) = o(b) = 2, so (a), (b) are subgroups each 
of order 2, satisfying DP1 and DP3. But DP2 holds by remark 12, so 
G = (a) x (b) & Z. x Ze (since any group of order 2 is cyclic and thus 
isomorphic to Z2). O 


Example 15. Euler(8) and the Klein group are each isomorphic to Zz x Zo. 


Thus we begin to see that the direct product construction encompasses 
all of our examples of Abelian groups. 


Corotiary 16. If |G| < 6 then G is Abelian, and isomorphic to one of 
the following groups: {e}, Zo, Z3, Za, Z2 Xx Zo, Zs. 


Proof. Let n = |G. If n is prime then G is cyclic and isomorphic to Zp. 
Thus we may assume n is not prime, i.e.,n = 4 = 27. If G % Z4 then 
G is not cyclic, so exp(G) 4 4 by example 3.4(ii). But then exp(G) = 2, so 
GZ. x Zs by Proposition 14. O 


This result is typical of what we have in mind when we are classifying 
groups. Nevertheless we must be careful not to overlook hidden isomor- 
phisms. 


PROPOSITION 17. If H and K are cyclic groups of respective orders m and 
n which are relatively prime, then H x K & (Zmn,+). 


Proof. In view of Theorem 2 we need only prove (Zm,+) X (Zn, +) is cyclic 
of order mn. But ([1], [1]) obviously is an element of order mn, as desired. 0 


For example, Z3 X Z2 & Ze. 

The direct product of an arbitrary number of groups is defined to be 
the Cartesian product with the operation defined componentwise. This 
construction can provide groups that are generated only by large numbers 
of elements, cf. Exercise 2, thereby answering the question posed earlier. 


Remark 18. Gy x ...x Gi & (Gi x... x Gt_1) x G;, under the isomorphism 
(915-594)  ((o1.--- 594-1), 94). [Ga x... X G;| = |G]... |G], by induction 
on t. 


The direct product of monoids gives information about groups of units 
and has a cute application to number theory, as seen in Exercises 5ff. 


Exercises 


Li 


Any group of exponent 2 and order 2™ is isomorphic to Z)x---x Ze 
(taken m times). 


. Zo X Ly X +++ X Zo (taken m times) cannot be generated by fewer 


than m elements. 


. Hx K » K x H under the isomorphism (h,k) 6 (k,h). 


4. (Gy x G2) x G3 ie) Gi x (Gy x G3). 


. If M, and M2 are monoids, then 


Unit (MM, x Mz) ~~] Unit (M,) x Unit (M,). 


(Hint: Invertibility is by components.) 


. If m and n are relatively prime then, (Zimn,:) © (Zm,+) X (Zn,:) as 


monoids. (Hint: [a]nn 0 ([a]m:[a]n) defines a 1:1 map.) 


. If m,n are relatively prime then the Euler number y(mn) equals 


p(m)p(n), by Exercises 5,6. Using Exercise 2.9, conclude that if 


pi,---,pt are the distinct prime numbers dividing m then 
1 1 
m)=m(1l—-—)...(1-— 
(mm) = m(1— )...(1- =) 


. If Gis an internal direct product of H and K and if A, B are normal 


subgroups of H, K respectively, then G/(A x B) =» H/A x K/B, 
viewed in the natural way as an internal direct product. (Hint: 
Noether I.) 


. (Direct product cancellation). If G x H, =» G x Hy» with |G| finite 


then H, ~ Hy. (Eatended hint: It is the same to prove that if a 
group K can be written as an internal direct product G, x Hy, = 
G2 x Hy» for subgroups G; and A;, 1 = 1,2, and if Gy = G», then 
A, » Hp. If HW, Gy = {e} then H, » K/G, = Hy so one is done. 
Thus one can put A = A, NG, 4 {e} and B= H,NG, F {e}. Use 
Exercise 8 repeatedly: 


G2/Ax H,/Bx G/AB=G/BAxG/Bx H\/A (1) 
as internal direct products, so 


Gy X Go/A x H,/Bx Go x Gi/Bx H,/A (2) 
a G/B x Ay x G2/A. 


Analogously 


G, x G/B x H,/Ax Go/Ax Hy x G/B. (3) 


But (1) together with G, * G2 shows the left hand sides of (2) 
and (3) are isomorphic, so the right hand sides are isomorphic; by 
induction cancel G/B and then G2/A.) 

10. There is an injection S;, x S, > Soy, given by 


Lawes 90 ntl wee. “240 
(9, T) > e won (n+l)... a) d 


WEEK 7. FINITE ABELIAN GROUPS 


Abelian groups have a much more manageable structure than groups in 
general. We shall see soon that Abelian groups are not much more compli- 
cated than cyclic groups, but first let us illustrate their nice structure by 
means of some easy facts. 


Remark 0. For any Abelian group A and any positive number m, define 
A(m) = {a € A: a™ =e}. Then A(m) < A. (Indeed if a” =e and b™ =e 


then (ab-')™ =a™(b™) | =ee | =e.) 
Remark 0 fails for non-Abelian groups — S3 has four elements of order 
dividing 2, so this set is too large to be a subgroup (by Lagrange’s theorem). 


Remark 1. For any Abelian group A and any m > 0, there is a homomor- 
phism f:A—> A given by a a™, and ker f = A(m). 
We shall also need Cauchy’s theorem, proved in general in Theorem 
3.1, but the Abelian case needed here is an easy exercise, cf. Exercise 1. 
Our present goal is to characterize finite Abelian groups. We shall 
do this in terms of generators. Like most general notions of group theory, 
generators become much easier in the context of Abelian groups; a set 


{a1,...,a} generates an Abelian group A iff every element of A can be 
put in the form a{'... a; for suitable 21,...,7 in Z; if A is finite then we 
can take 21,...,74 in N. 


Obviously for any ¢t the direct product C, x --- x C; of cyclic groups 
Cy, = (a1),...,C; = (az) is Abelian, generated by the ¢ elements 


(a1, €,€,...,€), (€,02,€,...,€),...,(€,€...,€, a4). 
Our main object is the following amazing converse, foreshadowed by Exer- 
cise 6.1: 


THEOREM 2 (FUNDAMENTAL THEOREM OF FINITE ABELIAN GROUPS). 
Every finite Abelian group generated by t elements is isomorphic to a direct 
product of t cyclic subgroups. 


Of course one could throw in redundant generators (such as e), and 
on the other hand one could tack on direct products with copies of {e}, 
since any group G & G x {e}, and it will be convenient to permit these 
redundancies. 

We shall see that this decomposition is unique, up to isomorphism and 
permutation of the cyclic components, and insertion or deletion of copies 
of {e}. This result classifies all the finite Abelian groups, since we have 
already classified the cyclic groups (Theorem 6.2). However, the theorem 
presents a pedagogical dilemma, since actually a much stronger result is 
true: 
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THEOREM 2! (FUNDAMENTAL THEOREM OF FINITELY GENERATED 
ABELIAN GROUPS). Every Abelian group generated by t elements is iso- 
morphic to a direct product of t cyclic subgroups. 


One can prove Theorem 2’ directly and elegantly with the tools we have 
in hand, but the proof is very intricate; we follow a more straightforward 
approach due to C.R. MacCluer, which is more apt for finite groups and 
gives a strong version of Theorem 2. The proof of Theorem 2’ is sketched in 
Exercise 7, (also cf. Exercises 8ff.) in the hope that its idea of proof might 
actually become clearer without the distraction of certain details. Also see 
Exercises 14-17 for more information concerning infinite Abelian groups. 

Some of the methods used in our proof are applicable to arbitrary 
groups, and others are suited best to Abelian groups; to keep the threads 
straight we shall let G denote an arbitrary group, and let A denote an 
Abelian group. 

In Chapter 6 we showed that a group was isomorphic to a direct prod- 
uct of two subgroups iff it was an “internal direct product,” 1.e., iff the two 
subgroups satisfied properties DP1, DP2, and DP3 of proposition 6.11. 


Definition 3. A group G is called an internal direct product of t subgroups 
A,,..., H, if the following properties are satisfied: 

(DP1’) If hy...h: =e for h; in H;, then each h; = e. 

(DP2') H,H,...H:;=G; 

(DP3') hyhj = hjh; for any i # 7, h; € Hj, and h; € Hj. 


The reader should check that DP1’, DP2', and DP3’ coincide with 
DP1, DP2, and DP3 respectively, when t = 2. 


Remark 3'. If G is the internal direct product of H and K, and if K is the 
internal direct product of Ky,..., Ay, then G is the internal direct product 
of H, K,,..., Ky. (Indeed, DP2' and DP3’ are clear. To see DP1' suppose 
hky...k, =e; then ky... = hol EC HANK =e, 80 ky =--- = hy =e.) 


PROPOSITION 4. 

(i) Given groups H,,..., H+, let G = H, x---x H;, and define y;: H; > 
G by v,(h) = (e,...,e,h,e,...,e), where h appears in the i-th position. 
Clearly ~p; is an injection, and G is an internal direct product of its sub- 
groups H; = ;(H;). 

(ii) Conversely, if G is an internal direct product of subgroups H,,..., H:, 
then Gs H, x--- x Hy. 


Proof. 
(i) For h; in H; write hy = 
(hy,..., ht) in G we see (hy,...,h+ 


pilhi) = (e,...,e,hi,e,...,e). For all 
)=hy...hy € Ay...Hy, proving DP2’; 


if e = hy...hy = (hy,...,h4), then each h; = e, proving DP1’. DP3’ is 


(ii) Define ¢: Hy x --- x Hy 3 G by f((ia,...,ht)) = ha... he. b is a 
homomorphism since (by DP3’) 


= P((Pa.--he))b((hy 4), 


and ¢ is onto by (1); finally (fi,...,ht) € ker iff hy... ht = e, iff each 
h; =e, proving ¢ is 1:1. 0 


Abelian p-Groups 
Since we are considering Abelian groups here, DP3’ will be obvious. 
Let us start with a very important special case. 


Definition 5. For p a prime number, we define a p-group to be a group 
whose order is a power of p. 


Remark &'. If pis the only prime dividing exp(G), then G is a p-group, by 
Cauchy’s theorem. 


By Lagrange’s theorem any subgroup of a p-group is a p-group, and 
likewise any homomorphic image of a p-group is a p-group. Our strategy 
is to prove Theorem 2 first for p-groups and then find a direct product 
decomposition into p-groups, which enables us to write every finite Abelian 
group as a direct product of cyclic p-groups (albeit the number of factors 
may be considerably larger than the number of generators t); finally we 
shall show that some of the cyclic factors can be combined into larger cyclic 
groups, yielding a direct product of ¢ cyclic subgroups. 

We say a subgroup H of G is a direct summand of G if G is the 
internal direct product of H and a suitable subgroup K of G; K is called a 
complement of H. Recall by Example 3.4(ii) that for any p-group G there is 
an element hin G such that o(h) = exp(G). The next result is the key step 
in our treatment, since it permits one to start peeling off direct summands 
from A (thereby paving the way for induction). 


THEOREM 6. Suppose A is an Abelian p-group, and h € A such that 
o(h) = exp(A). Then (h) is a direct summand of A. 


Proof. Induction on n, where |A| = p”. Clearly exp(A) = p™ for some 
m <n. We are done if m =n, i.e., if Ais cyclic (for then A = (h) x {e}); 
thus we may assume n > m. Let H = (h). 

We claim A\ H has an element a of order p. Indeed, by Cauchy’s 
theorem the residue group A/H has an clement b of order p:; its preimage 


bin A satisfies b? € H, so b? = h4 for some gq. If pf q then (h2) = (h), so 
o(h?) = o(h) = exp(A), implying 0(b) = pexp(A), contradiction; thus plq. 
Writing q = pt shows b? = (h*)P; hence a = bh~* has order p and a ¢ H as 
desired, since b ¢ H. 

Let L = (a), a subgroup of order p. Then LM H = {e} since its order 
properly divides p. Let~ denote the canonical image in A = A/L. Then 
H & HT by Corollary 5.19. Hence p™ = o(h) = o(h) divides exp(A), which 
in turn divides exp(A) = p™. Consequently, exp(A) = p™ = o(h), and thus 
by induction H is a direct summand of A. Let K be a complement of H in 
A. Then HK = A and HN K = {e}. Let K be the preimage of K in A. 

Given a in A we have h~‘a € K for suitable i; then letting k = h-‘a 
we see a = h’k and k € K, proving A = HK. Furthermore, HN K = 
Hn(HnN Kk) C HNL= {fe}, implying A is the internal direct product of 
AH and K.O 


We can find A explicitly by using any set of generators of A. 


COROLLARY 7. Suppose A is an Abelian p-group gencrated by {a1,...,a+}, 
and o(a;) is maximal among 0(a@,),...,0(a:). Then exp(A) = o(a;); thus 
(a;) is a direct summand of A. 


Proof. Let o(a;) = m. Clearly m|exp(A); on the other hand o(a,;)|m for 
each j, implying exp(A) = m by Remark 6.8(vi). Hence (a;) is a direct 
summand of A, by Theorem 6. 0 


We are ready for a strong version of Theorem 2, for p-groups. 


THEOREM 8. Every finite Abelian p-group generated by elements a,,..., G+ 
is an internal direct product of t cyclic subgroups, one of which may be 
taken to be (a;), where o(a;) is maximal among o(a,),...,0(a+). 


Proof. For convenience, take i = 1. Then (a1) is a direct summand of A by 
Corollary 7; take a complement K in A, and let yg: A > K be the projection 
onto K (given by ay“k 1 k). Then g(a) = e, so plag),..., (az) gener- 
ate K. By induction (on t), K is an internal direct product of cyclic p-groups 
C,,...,Cz-1, so A is the internal direct product of (a1),C1,....,Cr—1, by 
Remark 3’. 0 


Aside 9. Following the proof of Theorem 8, we see that each cyclic direct 
summand C; of A actually is obtained as a projection of one of the (a;). 


Now let us make the reduction to p-groups. First we need a way of 
locating p-groups inside a given group. 


Definition 10. A subgroup H of a group G is a p-Sylow subgroup of G if 
|H| = p* where p**1 { |G. 


Of course, if H < G and |H| = p*, then p*||G| by Lagrange’s theorem: 
since p*+1 4 |G|, we conclude that a p-Sylow subgroup cannot be properly 
contained in any p-subgroup of G. 


THEOREM 11 (SYLOW’S THEOREM FOR ABELIAN GROUPS). Suppose A is 
a finite Abelian group, and p is a given prime number dividing |A|. Then 
(i) A has a p-Sylow subgroup S; 
(ii) S={a€ A: o(a) is a power of p}. S contains every p-subgroup 
of A, and thus is the unique p-Sylow subgroup of A. 


Proof. Write |A| = pq for q prime to p. 

(i) A contains a nontrivial subgroup N of order p, by Cauchy’s theorem. 
N <A since A is Abelian, and |A/N| = oe = p*~1q. By induction on |Al, 
we see A/N contains a p-Sylow subgroup which by Remark 5.19 has the 
form S/N for suitable S < A; then |S/N|= p*~! so || = p*. 

(ii) Let p* be the largest power of p that divides |G]. Then A(p") < A, 
by Remark 5’, and is a p-group, by Remark 5’. If AH is any p-subgroup 
and h € H, then o(h) is a p-power so h € A(p*), proving H C A(p*). In 
particular A(p”) is the unique p-Sylow subgroup of A. 0 


Actually, (i) also has an easy proof, cf. Exercise 1(ii). One could also 
prove the uniqueness part of (ii) via Proposition 11.6. 


COROLLARY 12. Any finite Abelian group is the internal direct product of 
its p-Sylow subgroups. 


Proof. Let py,....pu be the distinct primes dividing |A|. Let H; denote 
the p;-Sylow subgroup of A, and let Ag = Hy H2...Hy—-1. By Corollary 3.14, 
|Ao| divides |y||H||...|Hu—il; hence H; is also the p-Sylow subgroup of 
Ao, for 1 <7< u-—1. By induction on wu, Ao is the internal direct product 
of H,,...,Hy—1. Furthermore, |Ao| and |H,,| are relatively prime, so, by 
Remark 6.12(ii), A is the internal direct product of Ag and H,,, and we are 
done by Remark 3’. 0 


Before proving Theorem 2, let us remark that Corollary 12 can be 
applied directly, to reduce many questions about Abelian groups to Abelian 
p-groups, cf. the next result as well as Exercise 4. 


Remark 12'. Suppose A is Abelian of exponent m. If |A] = m, then A is 
cyclic. In general, A has some element of order m. (Indeed, write m as a 
product py! .. .p' of prime powers. Then each Sylow pj-subgroup has some 


element a; of order P; by Example 3.4(ii), so take a = a1 ...dy.) 


Proof of the Fundamental Theorem for Finite Abelian Groups 


Proof of Theorem 2. Write an Abelian group A as the internal direct prod- 
uct of its p-Sylow subgroups H,,...,H,. For each 7 < wu we have the 
projection ~;:A > H,; given by oj (hi... hu) = hj. If ay,...,a¢ generate 
A then i;(a1),...,4;(az) generate H;, so by Theorem 8 we can write H; 
as a direct product of cyclic p;-groups Cj1....,Cjz. (If Hj is a direct prod- 
uct of fewer cyclics, just throw in several copies of the trivial group.) But 
letting C; = Ci; x --- X Cui, we then see (by rearranging the Cj;) that 
AwXC, x--- x Cr, and each C; is cyclic by Proposition 6.17. 0 


The Classification of Finite Abelian Groups 
Our results provide a procedure for describing a finite Abelian group A 
generated by t elements as a direct product of cyclic p-groups (where p 
ranges over the prime divisors of |A]): 
(i) Write A as the internal direct product of its p-Sylow subgroups, 
for those prime numbers p dividing |A| (taken in decreasing order), and 
(ii) Write each of these p-groups as an internal direct product of 
cyclic p-groups, taking in decreasing order. 

To conclude the proof of Theorem 2 we merely recombined the (rela- 
tively prime) cyclic factors. Knowing this, it often is convenient to forego 
this step. 

The reader should observe that since each p-Sylow subgroup is unique, 
we have uniqueness in step (i). Although step (ii) is not unique, we do have 


* which we obtain by means of the following 


uniqueness “up to isomorphism,’ 
result. (Actually a more general assertion has been given in Exercise 6.9, 


but the proof for the Abelian case is more intuitive.) 


THEOREM 13 (“CANCELLATION”). If finite Abelian groups G,H, and K 
satisfy Gx Hx Gx K,then H&K. 


Proof. First writing G as a direct product G, x --- x G; of cyclic p-groups 
(for varying p), we have G, x --: x G; x H&G, x --- x G: x K; peeling 
off G1,G2,... one at a time, we see it is enough to prove the result when 
G is a cyclic p-group. 

Write H,,K, for the respective q-Sylow subgroups of H and K, for 
an arbitrary prime number q. When g # p we sce (via the order) that 
{ce} x H, (resp. {e} x Kj) is also the q-Sylow subgroup of G x H (resp. 
Gx K): but Gx Hz Gx K, implying H, = K, for q # p. By Corollary 
12, it remains to show H, = K,. But G x H, » G x Ky, since these are 
the respective p-Sylow subgroups of G x H =» G x K; thus we may replace 
H,K by H,, Ky. In particular we may assume G,H,K are p-groups, with 


G cyclic. Note |H| = cs = = ||. We proceed inductively on 
|G|, noting the result is trivial if |G] = 1. 

For any finite Abelian group A, define f: A> A by a a?; then f is 
a homomorphism by Remark 1, whose kernel is nontrivial iff p divides | A]. 


In particular, |f(G)| < |G]. But 
f(G) x f(T) & f(G x H) & f(G x K) & f(G) x f(K), 


implying by induction that f(H) ~ f(A). Thus it remains to show that we 
can reconstruct H from |H| and f(). This result has independent value, 
so we state it separately, in a slightly stronger version. 


LEMMA 14. (Notation as above.) Any finite Abelian p-group A is deter- 
mined up to isomorphism by |A| and f(A). 


Proof. We write A = C, x... x Cy, where the C; are all cyclic groups 
and Is] DE ise IC > p. Then ker f & Ci(p) x +--+ x Ci(p), which has 
order p*. so pt = = 7oh: , thereby determining t. Also, taking wu < ¢ maximal 
such that |C,| > p, we have f(C;) = {e} for each i > u. Hence f(A) & 
f(Ci) x---x f#(C,,), and, by induction on |A] (since | f(A)| < |A]), the f(C;) 
are uniquely determined up to isomorphism. Thus it remains to show for 
each i that f(C;) determines C;. But if |f(C;)| = p™ then |C;| = p™t! 
implying C; & Z,m+1. 0 


In this proof we have encountered a new group invariant, the number of 
elements a satisfying a? = e. Lemma 14 also yields at once the uniqueness 
of the components in the fundamental theorem of finite Abelian groups: 


THEOREM 15. Any (nontrivial) Abelian p-group A is decomposable uniquely 
(up to isomorphism) as a direct product of cyclic p-groups C, x ++: x Ch, 
where |C\| >--- > |C;| > 1. 


Example 16. We shall now determine all Abelian groups of order 144, up 
to isomorphism. If |A| = 144, then A =» H. x H3; where H,, denotes the 
p-Sylow subgroup; i.e., |H2| = 16 and |H3| = 9. Hence H» could be 


Z16, Ze x Zia, La x La, La x Zo x Zia, or Zo x Lo x Zo x Za, 


which are nonisomorphic; H3 could be Zg or Z3 x Z3. Therefore, A is 
isomorphic to one of the following ten groups: 


Zig X Zo, Zig X LH X Ly, La X La X Log, La X La X Le X Zo, 


Lig X Le X Ln X Ly X Ho, Zig X £3 X Zs, Zig X Le X 3 X Zs, 


Lax laX 3 XL, LaX LyX lig X13 XL, or Ly X Ly X Le X Le X H3 XZ 3. 


Their respective exponents are 144, 72, 36, 36, 18, 48, 24, 12, 12, 6. Note 
that eight of these have different exponents and thus are nonisomorphic by 
inspection. The only pairs having the same exponent are Z4 x Z4 Xx Zo and 
La X Le X Ly X Ly, and Z4 X Za X Z3 X Zz and Z4 X Zo X Ze X Z3 X Zs. 
But canceling Z,4 from each, we now have groups of differing exponent (36, 
18, 12, 6), which are thus nonisomorphic. 


In general, suppose A is an Abelian group of order p”. Writing A as a 
direct product of cyclic groups Cy x +--+ x Cy, where |Ci| > |Cy| >..., and 
letting |C;| = p”*, we have 
i" pm = pear LIne 
implying n = mz + m2 +---+m,. Thus the number of nonisomorphic 
Abelian groups of order p” is precisely the number of ways we can write 
m= my +m. +--+ +m, with my > m2. > --- > m: this is called the 
“partition number,” as explained in Exercise 13 and Exercises 16.16 ff. 


Exercises 


1. (i) Proof of Cauchy’s theorem for any Abelian group A. Induction 
on n = |A|. Take any clement a # ec, and let m = o(a). If p|m then 
o(a™/?) = p. If ptm then by induction A/(a) has an element 6 of 
order p, so o(b) = pt where ¢ divides m, and o(b*) = p. 

(ii) More generally, prove that any Abelian group A has a p-Sylow 
subgroup, for any p dividing |A|. (Hint: Apply induction to A/{a), 
where o(a) =p.) 

2. (i) An Abelian p-group is cyclic iff it has only p — 1 elements of 
order p. 

(ii) More generally, if A # Zy, X Zn, X +++ X Zp, then the number 
of elements in A of order dividing m is (m,n1)...(m,nz). (Hint: 
Reduce to the case A is cyclic.) 

(iii) If G and H are nonisomorphic Abelian groups, then there exists 
some m such that G has a different number of elements of order m 
than H. (Hint: Pass to p-Sylow subgroups.) This fact does not 
hold for non-Abelian groups, cf. Exercise 10.9. 

3. If a is an element of maximal order in a finite Abelian group A, 
then (a) is a direct summand of A. 

4. Any Abelian subgroup of S,, is isomorphic to (C1) x --- x (C+), 
where C),...,C; are disjoint cycles in S,. (This is not so easy; cf. 
Exercise 5.9.) 


5. Using Exercise 4, show that any Abelian subgroup of S, has order 
m = mMmy4...m, where m, +---+m: <n. Show that m is maximal in 
the following situations: 

(i) If n = 3k then m = 3"; 
(ii) Ifn = 3k4+ 1 then m = 4-3*-1; 
(iii) If n = 3k +2 then m= 2-3". 
Thus @ + oo as n > oo. 
6. Determine the analogues of Exercises 4 and 5 for A, in place of S,,. 


Finitely Generated Abelian Groups 

7. In this exercise we outline the proof of Theorem 2’, leaving the 
reader to fill in the details. To make the idea more intuitive we 
write the operation of A as addition instead of multiplication; 7.e., 
the neutral element is now 0. One wants to prove that A is an 
internal direct product of (a1),..., (a+) for suitable a; in A. The 
condition DP1’ now is: If 5+ n,a; = 0 for n; in Z then each nja; = 0. 
Accordingly we shall say a,,...,a4 satisfy a nontrivial dependence 
relation if Sn ;a; = 0 for suitable n,,...,n, in Z such that some 
{Ay ea 0. 

Take ¢ minimal such that A is generated by ¢t elements. We 
are done unless each generating set {a,,...,a4} satisfies a nontriv- 
ial dependence relation 5> n;a; = 0; by symmetry (7.e., renum- 
bering the a; if necessary) assume nia, 4 0. Of all generating sets 
{a,,...,a+} and all dependence relations S> n;a; = 0 with nya, #0, 
choose {a1,...,a+} and the dependence, such that |n,| is minimal. 
One may assume ny > 0. Note that |n;| > ny for any 7 such 
that nja; #0. By creating new dependence relations with smaller 
values of n1 prove the following reductions (in increasing order of 
difficulty): 

Reduction 1. If ma, =0 with m > 0, then m > n,. 

Reduction 2. If 3> m,a; = 0 is an arbitrary dependence relation 
(involving the same a1,...,a), then ny|my. 

Reduction 3. One may assume n4|n,; for all 7 (seen by replacing 
a, by a, + 4;, and applying induction to n;). 

Write n; = n1q; for all 1 <i < t. Define a) = a1 + 0;,, gia: 
then a),ao,...,a44 generate A. Furthermore n,a = 0. Let AH be 
the subgroup of A generated by az,...,a4. Then A & (ai) x H; 
conclude by induction on ¢. 

There is a way of bypassing these reductions, which casts light 
on generating sets, as we see in Exercises 8 through 11. Write M,(Z) 
for the t x t matrices with entries in Z. 


8. 
9. 


10. 


11: 


12; 
13. 


14. 


15. 


16. 


17. 


A matrix in M;,(Z) is invertible if and only if its determinant is +1. 


If m,,..., mz are relatively prime integers, then there is an invertible 
matrix in M;(Z) in which my, ..., m+ constitutes the first row. (Hint: 
Induction on ¢. Let d= gcd(my1,...,m+-1) and write m; = dn; for 
1<i<t-—1. Let Q be an invertible #-— 1x #—-—1 matrix with first 
row n4,...,M+~1, and let Q! be the ¢-— 2 x t—1 matrix consisting 
of all but the first row of Q. Then take 

Mp ve Me Me 

wy we UME 

Q= ay 
Q! 


where mv — ud = 1.) 

Suppose an Abelian group A is generated by aj, ...,a:, and let Q be 
any invertible matrix in M,(Z). Let (61, ..., 64) = (@1,...,a4)Q. Then 
b,....,b; also generate A. (Hint: (a4,...,a4) = (b1,....6:)Q-*.) 

If an Abelian group A is generated by ay, ..., a;, and if m1, ..., m+ are 
relatively prime integers, then there is another set of ¢ generators 
that includes S7*_, maj. (Hint: Use Exercises 9 and 10.) 

Bypass reductions 1,2,3 in Exercise 7, by means of Exercise 11. 
Define the partition number P; to be the number of ways one can 
write ¢ as a sum of positive integers tj + tg +---+¢, , such that 
ty >t, >--: > ty. For example, P; = 3, Py = 5. Let #(n) denote 
the number of Abelian groups of order n. Then #(p") = P,, for any 
prime number p, and in general if n = pi ... pi with the p; distinct 
primes, then #(n) = Pi, ... Pt. (Compare with Example 16.) The 
partition number is computed in Exercise 16.16ff. 

Define the torsion subgroup t(A) = {a € A: a has finite order}. 
Thus a € ¢(A) iff a“ = e for suitable u > 0. A is called torsion-free 
if t(A) = {e}. Show that t(A) < A, and A/t(A) is torsion-free. 
Any finitely generated torsion-free Abelian group A is isomorphic 
to a direct product of m copies of Z, for suitable m, and m is 
uniquely determined (called the rank of A). (Hint: To show that m 
is uniquely determined, it suffices to show that if m <n and there 
are n Xm and m xn matrices (over Z) whose product is the identity 
n Xn matrix, then n = m. But this follows from Exercise 8.) 

Any finitely generated Abelian group A is the internal direct prod- 
uct of ¢(A) and a torsion-free Abelian group that is unique up to 
isomorphism. 

Any finitely generated Abelian group is isomorphic to a finite direct 
product of copies of Z and cyclic p-groups (for suitable distinct 


prime numbers p.) This decomposition is unique, up to isomorphism 
and permutation of the cyclic factors. 


CHAPTER 8. GENERATORS AND RELATIONS 


In contrast to the Abelian case, generation in arbitrary groups is very com- 
plicated; we need some other ingredient to describe a group, namely the 
relations among the generators, 1.e., those expressions among the generators 
that are equal. 


Example 0. Suppose A is any finite Abelian group. By the Fundamental 
Theorem, A is an internal direct product of subgroups (a1),..., (a+). Letting 
nj; = o(a;) we see that any element of A can be written uniquely in the form 
ay! ...a¢', where 0 < a; < nj; for each 2. (We shall say an element described 
this way is in “normal form.”) In particular, A is generated by elements 


a,,...,@4 which satisfy the relations 
(i) a:;' =e foreachi, where n; = 0(a,); 
(ii) a,a; = aja; for each 7,7. 


Of course, we also have other relations such as aja7 = as 


consequence of (ii) and associativity. All other possible relations among 


a;, but this is a 


the generators are extraneous. To be sure that our relations determine A, 
we could build the multiplication table of A, using only these relations and 
associativity. Indeed, we check by means of repeated applications of (ii) 
that 


and then we use (i) to reduce the powers of a; modulo n,, thereby arriving 
at an expression in normal form. 


By a presentation of a group G, we mean the description of G in terms 
of generators and enough relations to reconstruct the multiplication table of 
G. Any group G can be presented by means of the generators {a : a € G} 
and obvious relations {ab = c : a,b € Gand ab = c}; however, we are 
interested in finding efficient presentations, in terms of minimal sets of gen- 
erators and relations. Actually we are treading on rather thin ice here, since 
it is not guaranteed that G can be described “effectively” through gener- 
ators and relations, in the sense that there may be no procedure to check 
when two elements are equal! Nevertheless, many groups are described best 
in terms of generators and their relations. The notion of “generating” can 
be described intrinsically (cf. Exercises 18ff), but we shall rely mainly on 
Definition 6.5 and our intuition. 

Generation of a non-Abelian group is often much more subtle than 
generation of an Abelian group; although |S,,| = !, which grows rapidly as 
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n increases, S,, is generated by two elements having respective orders 2 and 
n, cf. Exercise 5.8. Nevertheless, the non-Abelian groups G of immediate 
interest to us will be described explicitly in terms of two generators a,b 
such that G = (a){b), i.e., G = {a'b? :0 <i <u, 0 < 7 < v} for suitable 
u,v (which is impossible for S,, n > 6, ef. Exercise 11). Also these groups 
will be defined by rather few relations. 


Remark 1. Suppose G is generated by {a,b}. 

(i) If a,b commute then G is Abelian. 

(ii) If N < Gand aNa-'=N and bNb-! = N, then N4G. 

(iii) If ba*b~+ € (a*), then (a‘) «G, by (ii). 

(iv) If y:G > HI is an onto group homomorphism then {y(a), y(b)} 
generates H. 


Description of Groups of Low Order 
Let us start by describing all groups of order 6. 


Example 2. Suppose |G| is an arbitrary group of order 6. By Cauchy’s 
theorem, G contains an element a of order 3 and an element b of order 2. 
Then G = (a)(b) by Remark 6.12 (7.e., |(a)(b)| = 2 = 6). The information 
obtained so far is given by the two obvious relations 
a =e; (1) 
PS, (2) 
In order to foster a new meaningful relation, note that (a) has index 2 in G 
and thus is a normal subgroup, by Proposition 5.9. Hence bab~! € (a) = 
{e,a,a7}. But o(bab~!) = o(a) = 3, implying bab-! 4 e. Thus we have 
two possibilities (noting a? = a7?): 
bab-' =a, i.e., ba = ab: (3') 
bab-t =a", i.e, ba=a'b. (3") 
By Example 0, relations (1),(2), and (3’) define Z3 x Zz & Ze, so we 
need only concern ourselves with (3), which implies 
(ab? )(a“b’) = a*b? 1 (ba)a*— 1b” = a’bF hat ba" 1b” 
-_ abi (a T)tpt? = abi tg7 pity 
= a’bi 2 g"h2t¥ oe git(-Diupite. 
Then we can use (1) and (2) to reduce i + (—1)/u to its residue mod 3, 


and 7 + v to its residue mod 2. Since we have constructed the complete 
multiplication table, we have proved 


Remark 8. There are at most two nonisomorphic groups of order 6, each 
generated by two elements; one group is Z3 X Zo, given by relations (1), 


(2), (3'), and the other is given by (1), (2), (3”). 
In order to push this analysis further we need an obvious observation. 


Example 3'. In Example 2, the group corresponding to (3”) is not Abelian 
(since ba = ab # ab). But $3 is non-Abelian. We conclude: 

Relations (1), (2), (3') define Z¢ = Z. x Zs. 

Relations (1), (2), (3) define S53. 


Thus all groups of order 6 are isomorphic to Z, x Z3 or S3. 


A general principle in mathematical discovery is that good ideas go a 
long way. Perhaps we can generalize Example 3” to handle degrees other 
than 6 = 2-3. Note that the reasoning that (1), (2), and (3) determine the 
multiplication of G also yields the following more general observation: 


Remark 4. Suppose G is generated by elements a and b such that (a) <«G. 
(i) G = (a)(b). Indeed, bab~+ € (a), so bab~* = a’ for some i. But then 
ba = a‘b, so that we can always reduce an expression in a and 6 to the form 
a“b” for suitable u,v; namely, we replace any occurrence ba by a*b, thereby 
eventually moving all occurrences of b to the right of all occurrences of a. 


(ii) |(a) 9 (b)| = “22. seen from 


(iii) (a) M1 (by = (al@l/o)) = (plGl/0(4)). Indeed, 


(a)IN()] __o(@o(b) 


OOM 
N= Tay Gy] ~ [ayn GT 


in particular = Se. which divides o(a). Hence, by Corollary 6.4, 
(a) Nb) is the (unique) cyclic subgroup of (a) of order oe = o(al@l/o(®)), 


yielding the first equality; likewise, 


(a) 1 (b) = (b) 9 (a) (BIG), 


Note that when o(a)o(b) = |G], (iii) says (a) (b) = {e}. Often we can 
make do with this easier special case. 
(iv) Any element of G can be written uniquely in the form a‘b! for 


0<i<oa),0<j< A. Indeed, we lower the power of 6 by means 


of (iii), since we can replace b'¢|/°( by a suitable power of a. If a'b? = aé bs 


for 0 < i,7' < o(a) and 0 < 3,7’ < aa then 


or Sy re (a) 0 (by = (pIG/oy, 


so 7 —j' = 0 yielding 7 = 7’; hence, a’ 


ov 
4 


=e,so0i—i'=0, yielding 7 =7'. 


Definition 5. (n arbitrary.) The dihedral group D,, (if it exists) is the group 
generated by two elements, satisfying the relations 


bab a “(Sa *): 


Of course, we have not yet proved that D, is a group of order 2n, but 
we assume it now, deferring the proof until Chapter 9. One can also prove 
this fact by direct verification, or geometrically (Exercise 2), or algebraically 
(Exercise 3). 


Remark 6. (a) <4 .D,, by inspection, so Remark 4 shows that D, = (a)(b); 
also (a) 1 (b) = {e}, and every element of D, can be written uniquely in 
the form a°bh? :0<i<n-1,0<j<1. 


Now we can characterize groups of order 2p, for any odd prime p. 


Example 7. (Generalization of Example 2) If |G] = 2p with p an odd prime, 
then G & Z2 X Zy & Zoy or GX Dy. 

Indeed, by Cauchy’s theorem, G contains an element a of order p and 
an element 6 of order 2. [G : (a)] = 2, so (a) 4G by Proposition 5.9. Since 
p and 2 are relatively prime we see that (a) M (b) = {e}, so the discussion 
above yields G = {a'b? :0 <i <p, 0 <j < 1}. On the other hand, a,b 
satisfy the relations 


Bie: (4) 
bY =e, (5) 
bab~' = a" for suitable pi: i.e., ba = ab. (6) 


As in Example 2, these relations determine the multiplication table and 
thus define the group up to isomorphism; furthermore we can use (5) to 
determine the possibilities for ju: 


a= bl? ab~* = b(bab)~1b~* = ba“b~t = (bab~*)4 = (a#)* = al 


implying a ~! = e, so p = o(a)| p2—1 = (—1)(-+1). Thus p divides p+1 
or 2 — 1, so p = +1 (mod p). Since a? = e, we may take pp = £1. When 


pt = 1 we have bab~! = a, i.e., ba = ab and G is Abclian by Remark 1. 
When js = —1 we have G & Dy. 


Before concluding this line of reasoning, let us describe an even further 
generalization of Example 2, which however requires a result we have not 
yet proved. 


Example 8. Suppose |G| = gp with q < p prime numbers. G contains an 
element a of order p and an clement 6 of order q, and (a)<G by Corollary 5.7. 
Thus G= {a'b?:0<i<p, 0< 47 <q}, and the generators a,b satisfy the 
relations 


oP =e, (7) 
bi’ =e 


bab—! 


II 
a 
= 


for suitable 1<p<p-1. (9) 


As in Example 7, we can limit the possibilities for ys by noting 


a = b%ab—% = bt (bab—1)b--Y) = 


— pt-1gHp-(a-D) — pi 2g p-(4-2) =. gh 
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implying uw? = 1 in Euler(p). Since Euler(p) is cyclic (to be proved in 
Theorem 18.7), the subgroup H = {y: wp? = 1} has order d = (g,p—1), by 
Corollary 6.4 and Example 2.6’, so dis the number of solutions to 4 = 1 
(mod p). Since q is prime, we see 


d=1 if qft(p-—1), in which case 4s = 1 is the only solution; 
d=q if q|(p—1). 


Of course, js = 1 corresponds to the Abelian group Z, X Zp, & Zqyp. Thus, 
there exists another solution precisely when gq|(p — 1), in which case the 
relations (7),(8),(9) define a new group (if it exists), called the semidirect 
product of Z, and Z,. Actually we have (q — 1) possibilities, corresponding 
to the q—1 choices for wp 4 1, but in fact these all correspond to isomorphic 
groups, as indicated by the following argument: 

Suppose G is a group corresponding to a particular relation bab~! = a". 
Replacing 6 by 6! = b? for any 1 < j < q, we note that {a,b'} generates G, 
and 0(b')= q, but now b’a(b')~1 = al”. Clearly, pyp?,...,p4-" are distinct 
mod q since o() = q (in Euler (p)), so each of the different solutions p 4 1 
of (9) provides a different presentation of the same group G. (To formulate 
this result precisely, see the addendum and Exercise 23.) 


In Exercise 6 we obtain a concrete realization of the semidirect product, 
and thereby see that it exists. The semidirect product will be put into a 
much wider context in Exercise 12.26. 


Example 9. Every group of order 15 is isomorphic to Z15 & Z3 x Zs, since 
3 does not divide 5-—1= 4. 


Our final example is the classification of groups of order 8. This will 
display the techniques we have developed until now. 


Example 10. Suppose |G| = 8. If G is Abelian, then G is isomorphic to 
Zs, LX Le, or Ly X Ley X Ze, by the fundamental theorem of finite Abelian 
groups. Thus, we shall assume G is not Abelian; in particular, exp(G) 4 2 
(by Proposition 3.5) and exp(G) # 8 (since otherwise G is cyclic). Hence 
exp(G) = 4. Take a € G of order 4; then (a) has index 2 in G, and thus is 
normal. Take any b € G\{a). Then bab~? € (a). But o(bab~!) = o(a) = 4, 
so bab~! € {a,a°}. Furthermore, |(a)(b)| is a power of 2 greater than 4, and 
so equals 8 = |G, i.e., G = (a){b). Since G is non-Abelian, bab~1 ¥ a, so 
we conclude bab~! = a°. Thus far we know the generators a and b satisfy 
4 


=e and bab—! = a°, and next we want to determine the 


the relations a 5 


order of b. Clearly, there are two possibilities: 2 and 4. If o(6) = 2 then 
G = Dg, so we consider the remaining choice, o(b) = 4. By Remark 4 (iii), 


{e,a?} = (a) N (b) = {e, b?} and thus a? = b?. 
Let us give this potential new group a name. 


Definition 11. The quaternion group Q is the group of order 8, having 
3 


generators a,b satisfying the relations a* = b+ = e, a? = b?, and bab! = a?. 

It is easy to build the multiplication table from these relations; Q is 
indeed a group (Exercise 9), and Q = {a'b? :0<i< 4, 0<j< 1} by 
Remark 4(iv), but we must wait until appendix B to see how Q derives its 
name. D4 and Q both have exponent 4 and share many other properties in 
common. However, the number of elements of order 2 differs (cf. Exercises 
1 and 8). We summarize our findings as 


Remark 12. Any group of order 8 is isomorphic to one of the following 
(nonisomorphic) groups: Zs, Z4 x Zo, Zo xX Zo xX Zo, Da, or Q. 


Combining Example 2 and Remark 12 with Corollary 6.16, we have 
classified all groups of order < 8. Furthermore, we have classified all groups 
of order pq for p # q prime, in Example 8. Thus the only remaining case 
for |G| less than 12 is 9, to be dealt with in Example 9.9. 


Addendum: Erasing Relations 

Our point of view until now has been to start with a given group and 
determine its relations. This treatment bypassed the difficult question of 
exactly what a relation is, and we should like to consider this question, even 
though it removes us from the realm of finite groups. To gain intuition, let 
us see what happens when we start erasing relations. Perhaps the most 
natural thing to erase is the n from Dy. 


Example 13. The infinite dihedral group D is defined as having generators 
a,b satisfying the two relations b? = e and bab-! =a~!. Thus D = fa‘! : 
1E€Z,0<37< 1}. (Note that D is infinite.) As in Example 2 we see 


(a'b)(a%b”) = att OD upit 


implying for any n that the map y:D > D,, sending a‘b! to its value in 
D,, is indeed a homomorphism, for 


o((a'b!)(a%b”)) = plait Y"Hi4) = (aibl )p(a"b”). 


yg is onto, by inspection; ker @ = {a’b) :i =0 (mod n) and j = 0} = (a”). 
Actually, we have not yet proved D is a group. This can be seen directly, 
cf. Exercises 12 and 13, but we can avoid many difficulties by erasing all 
the relations, thereby obtaining a group free of all relations. Ironically, the 
easiest method of constructing this group is by returning to monoids. 


Example 14. The free monoid on aset S = {s1, 82,...} 1s the set of “words” 
(.e., strings of elements of S) including the “blank word” (which has no 
letters in it); the operation is juxtaposition of words, e.g., ($185)(s28184) = 
8185828184, and the neutral element is the blank word. Now take a disjoint 
copy S’ = {s),s,...} of S, and build the free monoid M on SU S'. We 
say two words in M are equivalent if we can obtain one from the other by 
successive insertions and/or deletions of various s;s!, or s‘s;. For example, 
$1 $3598 is equivalent to 815955 85, since first we delete 5383 and then insert 
8585 at the end. Write [h] for the equivalence class of the word h. Clearly, 
the equivalence classes of M form a monoid G(S) (under juxtaposition and 
reduction), which is a group since we obtain the inverse by switching s; and 
sand then reversing the order of the word. For example, the inverse of 
[s18484] is [$4898] since 
[81 854848284] = [81858281] =[s18,] =1 

(writing 1 for the equivalence class of the blank word) and, likewise, 

/ 


[548284818584] = [84828584] = [5454] = 1. 


In particular, [s¢] = [s,]~?. 


Remark 15. G(S) is called the free group on |S| generators and satisfies the 
important property that for any group G and any sect {a1,a@2,...} in G of 
the same cardinality as S, there is a group homomorphism #:G(S) ~ G 
given by [s;] > a;. (See Exercise 15 for the proof.) 


Now suppose {a1,@2,...} is a generating set for G. The relations of 
this set are precisely the elements of ker4), and so our quest for a minimal 
set of relations is simply a search for a minimal set A of elements in the 
free group, for which ker y is the smallest normal subgroup containing A. 

Now we look at the coin from the other side. For any set A of words 
in G(S), let N be the intersection of all normal subgroups of G(S') that 
contain A. Then N<G(S), and G(S)/N is the group defined by generators S 
and relations A, cf. Exercises 21 and 22. Thus any set of generators 
determines a group. 


As important as this construction is, there are several shortcomings. 
Given a set of words A there is no comprehensive method of determining 
whether G(S)/N is a finite group (or even whether G(S)/N is trivial!). 
Indeed, it is impossible to find an algorithm that will always determine 
(for arbitrary A) when two given elements of G(S') have the same image 
in G(S)/N; in technical language, the word problem is undecidable in gen- 
eral. 

Also the structure of G(S) is considerably more complicated than the 
structure of the finite groups that led us to G(S'). Nevertheless G(S) pos- 
sesses certain interesting properties of its own. For example, one important 
theorem outside the scope of these notes is that any subgroup of a free 
group is free (but not on the same number of generators, cf. Exercise 17!). 


Exercises 


1. o(a’b) = 2 in D,, for every i. Conclude that exp(D,) = 2n for 
n odd, and n for n even. 

2. Dy is a group; and |D,,| = 2n. (Hint: Divide the circle into n equal 
parts, starting at the right. Label these points 0,1,...,2—1, and 
define the transformation a that rotates the circle from 0 to 1 (and 
thus from 1 to 2, and so on). Define the transformation 6 that flips 


the circle along the horizontal axis, t.e., 1H n—-—1, 2H n-2, 
and so on). Show bab~! = a7}, 1.e., “flip rotate flip” corresponds 
to rotating the circle in the opposite direction. Thus the group 
generated by these rigid transformations corresponds to Dy.) 

One can show in fact that any finite group of symmetries of 
the circle has a cyclic subgroup of index 2 and thus is isomorphic to 
some D,, or Z,. The same general approach can be used in studying 


13. 


14. 


15. 


the finite subgroups of the symmetries of the sphere. What is the 
group of rigid transformations of the triangular pyramid? of the 
cube? See Weyl’s book, Symmetries, Princeton University Press, 
for a beautiful treatment of symmetry in nature and art. 

Identify D,, with the subgroup of S, generated by the two permu- 
tations (1 n— 1)(2 n—2)(3 n— 3)... and (1 2 3...n). 

Any subgroup of D,, is either of the form Z, Z2, or Dm, where 
m|n. (Hint: Exercise 1.) 

Define the semidirect product of Z, and Z,, in general, for any two 
natural numbers p,q such that q divides y(p). 

Suppose p is a prime number, and q|(p—1). The semidirect product 
of Z, and Z, can be identified with the subgroup of S, generated 
by a=(12...p) andb=C,...C(p-1)/q, where the C; are disjoint 
cycles of length q. (Hint: Take ps satisfying w4 = 1 (mod p), and 
C,= (fj wg pg... wt *7) for 1 < 7 < p—1, throwing away repe- 
titions.) Generalize this to the case p is not prime (cf. Exercise 5). 
Suppose q is a prime number that does not divide n. There is a 
non-Abelian group of order nq having a normal cyclic subgroup of 
order n, iff g divides y(n). 

Every element of Q other than e and a?(= b*) has order 4. Conclude 
that D4 and Q are not isomorphic. 

Q is a group. (Hint: Identify Q with the multiplicative subgroup of 


M2(C) generated by a = € =) and 6 = i i 


. Generalize the quaternion group to define a non-Abelian group of 


order p® and exponent p*, for any prime p. 


. Show that one cannot find o,7 € S4 such that (7)(r) = Sy. (Hint: 


Ko )IK(7)] < 16.) 


. Verify associativity in the infinite dihedral group D, namely that 


((a'b)(aP BT) (aT BT) = (ab) ((a"bT Yah BT). 


Avoid computation in Exercise 12 by noting that one need merely 
check it in D,, for n large enough, but D, is already known to be a 
eroup. (This argument illustrates a general principle in mathemat- 
ical logic, called compactness.) 

Show that the free monoid M(S) is free, insofar as for any monoid 
and any subset {a1,a2,...} of M there is a monoid homomorphism 
M(S) > M given by $;, 8), -++ 4 Gj, Qi, ..-. 

Prove Remark 15. (Hint: In Example 14 take S to bea group; given 
{a;,a2,...} C S, define 6: M(S) > M by 8; 6 a; and si 4 az!, 
‘sis; € kerd, so @ induces a monoid homomorphism 


49°4 


Now s5;s 


16. 
17. 


18. 


19. 
20. 


21. 


22: 


23. 


¢: M(S) > S, and any monoid homomorphism of groups is a group 
homomorphism.) 

The free group on 1 generator is isomorphic to Z. 

The free group on 2 generators contains a subgroup isomorphic 
to the free group on an arbitrary countable number of generators. 
(Hint: Consider {s1s}s1 :i € Z}.) 


Explicit Generation of Groups by Arbitrary Subsets 
Given any subset S$ of G, define the subgroup (S) generated by S, 
by the following inductive procedure: 


( 
(ii) S(i+1)=S(i)Ufgh! :g.h € S(i)}; 
(iii) (S) =Ujs1S(2) (in G). 


(S) < G, for if g,h € (S), then g,h € S(i) for some i, implying 
gh? € S(i +1) C (S). On the other hand every subgroup of G 
containing S contains (S). 

(S) is the subgroup generated by S, in the sense of Definition 6.5. 
If all the generators of a group G commute with each other then 
G is Abelian. (Hint: induction using Exercise 18.) 

If N«G, then G/N satisfies all the relations of G. Conclude that 
G(S) satisfies the fewest relations of any group generated by S. 
Suppose G is a group with generators S' and relations A, and N is 
the intersection of all normal subgroups of G(S) containing A. Show 
there is a surjection G(S)/N — G. If this map is not an isomor- 
phism, then G satisfies an extra relation not satisfied by G(S)/N 
(and thus not “implied” by A ). 

Use Exercise 22 to describe the semidirect product of Z, and Z, in 
the form G(S)/N, and thereby verify formally the assertion made 
in the text that any two groups obtained in this way (for different 
values of js) are isomorphic. 


CHAPTER 9. WHEN IS A GROUP A GROUP? (CAYLEY’S THEOREM) 


We have just constructed several new candidates for groups, e.g., certain 
semidirect products and the quaternion group, and have shown by various 
ad hoe arguments in the exercises that they are indeed groups. Several of 
these proofs involved displaying the proposed group as a subgroup of S, 
(cf. Exercises 8.3 and 8.6). We would like to see why these proofs work, 
and in doing so shall develop a general procedure that is guaranteed to 
work for any group. 

First we must understand what is meant by “displaying as a subgroup.” 
Philosophically, by “subset of S” we really mean a 1:1 map into S. Indeed, 
“2Z C Z” is a lazy way of saying there is an injection 2Z > Z given by 
2n ++ 2n. Along the same lines, suppose A is a set with a given operation - . 
Then, by “displaying H as a subgroup of G,” we mean finding a 1:1 map 
y: H > G such that 

p(hy he) = p(hi)p(ha) 
with y(H) < G; indeed, we shall see that H is a group because we can 
translate all the group axioms from y(ff) to H. Our main result will 
display arbitrary groups of order n as subgroups of Sj. 

Before proceeding, it is worth remarking that many good ideas in alge- 
bra are inspired by the terminology at hand. For example, many functions 
arising naturally between groups are homomorphisms. 

In Remark 1.11 we saw that the left multiplication map 0,:G ~ G 
given by 

fa(g) = ag 
is 1:1 and onto, for any fixed element a of G. Although £, itself is not a 
homomorphism, we do have the group A(G) of 1:1 correspondences from G 
onto itself, cf. Example 1.5(5), and we have a function g:G > A(G) given 
by y(a) = £,. The key idea is that y is a homomorphism. 


THEOREM 1. (Cayley’s Theorem.) Every group G is isomorphic to a sub- 
group of A(G). In particular, every finite group of order n is isomorphic to 
a subgroup of Sy. 


Proof. Since A(G) is identified with S,, when |G| =n, the second assertion 
follows from the first; thus, it is enough »:G — A(G) given by ah &, isa 
group injection. To check f,, = fl) we note 

larg = abg = a(bg) = ang = lalsg 
for all gin G. Thus, yg is a homomorphism. Finally, a € ker ¢ iff ¢, is the 


identity, implying e = ¢,e = a, thereby proving ker = fe}. 0 


Let us refine this argument, to check when a finite set G is a group 
under a given binary operation. We need to verify associativity and the 


Typeset by A,yS-TEX 


66 


existence of the neutral element and inverses. Once we have candidates 
for the neutral element and inverses, these are very easy to check by direct 
computation. On the other hand, associativity is difficult to verify directly, 


since it involves n3 


verifications, where n = |G]. 

At this point, we find ourselves in the rather interesting situation that, 
although Theorem 1 itself is inapplicable (since G is not yet known to be a 
group), its statement and proof provide the guidance we need. If we knew 
G is a group we would have the natural injection G > S', given by a &,; 
so why not exploit this map directly? 

Suppose, more generally, we can succeed in finding some 1:1 function g 
from G to a group G, which preserves the operation, 7.e., g(ab) = y(a)y(b). 
Then associativity in G is instantaneous, since 


p((abe) = plab)e(c) = (p(ae(b) elo = ela) (e)y(c)) = plalbe)) 


Likewise if m = o(y(a)), then y(ag) = v(g) = y(ga™) for all g in G, 
implying a™ is the neutral element of G, and thus a~! = a™~!, so we have 


proved G is a group. Explicitly, we state 


THEOREM 2. Suppose G is a finite set with a binary operation, and the 
map y:G > A(G) (given by a +> ¢,) is 1:1 and preserves the operation. 
Then G is a group, and 9 is a group injection. 


Remark 2'. If a € G and m = o(a), then for any g in G the elements 


g, ag, ..., a”~'g are distinct; consequently the permutation corresponding 
to £, is a product of II disjoint cycles, each of length m. 


Example 3. The quaternion group Q. Recall Q has generators a and b 
such that at = b* = e, bab~t = a®, and a? = 07. Label the elements 
e,a,a°,a>,b,ab,a7b,a°b of Q respectively by 1,2,3,4,5.6,7.8. To deter- 
mine which permutations correspond to £, and & we construct part of the 
multiplication table 
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The map é, sends eH a, aH a?, a 6 a3, a He, bb ab, and so on. 
Translating this to the labels of the elements we see £, © (1234)(5678) and 
likewise 4, © (1537)(2846) in S,. Thus, letting o = (1234)(5678) and 
T = (1537)(2846) in S,,, we define p:Q > S, by p(a'’b!) = o'7J. To apply 
Theorem 2 we must verify that 0,7 satisfy the same relations as a,b, so that 
the multiplication tables are the same (and thus ¢ preserves multiplication). 


o(o) = o(r) = 4 since o,7 are each products of disjoint cycles of 
length 4. 0? = (13)(24)(57)(68) = (13)(57)(24)(68) = 7?, and Remark 
5.20 yields 

tor | = (5876)(3214) =o *. 


Finally, we need to show that is 1:1. So suppose p(a'bs) = y(a* bi) 
for 0 < a,2 Sead 0 Se <1. Then o*~ 7s ~F € (a) N (tr) = (7), 
so 2\(j’ — 7) implying 7 = 7’, and thus z= 7’. 


Note. Of course, one would want a quicker, more intuitive way of determin- 
ing the permutations o and 7 corresponding to a and b. The idea is to use 
Remark 5.20 judiciously. In view of Remark 2’, ¢ is a product of two disjoint 
cycles of length 4. Since we may replace o by any conjugate (if we replace 
tr by its corresponding conjugate), we may assume that ¢ = (1234)(5678). 
Then 7? = 07 = (13)(57)(24)(68), so 7 = (1 ay 3 a2)(by 1 by co), where 
{a1, 02}, {b1,b2}, and {e1,¢2} are {2,4}, {5,7}, and {6,8} (but not nec- 
essarily in that order). But also ra7~1 = 0! = (8765)(4321), which is 
impossible if {a1, a2} = {2,4}. Thus, we may assume 7 = (1? 3?)(2? 4 ?). 
Noting (8765)(4321) = (5876)(3214), we can apply Remark 5.20 and take 
T= é PE SN ) = (1537)(2846). 


58763214 
Let us try a similar approach for Dy. Now we want 7? = (1) and 
tot | =o | = (8765)(4321), so by inspection we take 


T= (; aeons *) = (18)(27)(36)(45). 


Generalized Cayley’s Theorem 

We could apply our method at once to other groups, but would like first 
to make a slight adjustment to make the method more powerful, and inject 
a group G into S,, for suitable n < |G]. Take any H < G. Then £, acts on 
the left cosets {gH :g € G} , by fu(gH) = agH. Let S = {gH : g € G}, 
whose order is [G : H]. Identifying A(S) with 5; where k = [G : H], we 


have 


THEOREM 4. Suppose [G : H] = k. Then there is a homomorphism 
y:G— S; given bya é,, and kery C H. More precisely, a € ker y 
iff every conjugate of a is in H. 


Proof. (Asin Theorem 1.) Each ¢, permutes the left cosets of H, and 
langH = (ab)gH = a(b(gH)) = talogH, 


proving y is a homomorphism. Obviously, kerg<«G. Also, a € ker» iff 
agH = gH for all g in G,i.e., g-tagE H.O 


Thus, ker y is a normal subgroup of G contained in H. One way of 
making sure ¢y is an injection is by choosing H not to contain nontrivial 
normal subgroups of G. We cannot improve our previous argument for Q, 
since every nontrivial subgroup contains (a?) <Q. However, the improved 
method is applicable for D, for each n > 2; we can take H = (b), since 
aba! = a?b F b. 


Example 5. Dy. Recall D, has generators a,b satisfying a” = b? = e and 
bab~* = a"~!; taking H = (b) (of index 3 =n in D,) we label the cosets 
H,aH,...,a°~'H as 1,2,....n, and construct the multiplication table: 
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So £4, & (1 2... n) and f © (2 n)(3 n—-1)(4 n - 2).... Obviously, 
o(fa) =n and o(f)) = 2, and 44.6, ' @ (Lnn—-1... 2), 80 flal,* = &51. 
We have displayed D,, as a subgroup of S',. 


Although Theorem 4 inspires us to search for nonnormal subgroups of 
G in order to find injections into S;, we can also apply the contrapositive 
and obtain normal subgroups of G. 


CoROLLARY 6. Suppose H < G with |G| = n and [G : H] = k, and let 
d =gcd(k!,n). Then H contains a normal subgroup N of G, with [G : N] 
dividing d. In particular, N is nontrivial if nf k! 


Proof. In Theorem 4 take N = kery < H, and let ¢ = n/|kergy|. By 
Lagrange’s theorem t = [G : N] = |y(G)| divides |S;,| = k!; but also t|n, so 
t divides d. 0 


COROLLARY 7. Suppose p is the smallest prime number dividing |G|. If 
G has a subgroup H of index p, then H «4G. 


Proof. As in Corollary 6, take d =gcd(p!, |G|) = p (since |G| has no prime 
factor dividing (p — 1)!). Thus H contains a normal subgroup N of G of 
index dividing p, so [H : N] is a proper divisor of [G : N] = p, implying 
IH: NJ=1,ie,H=N.0 

However, G need not have any subgroup of index p; for example, we 


have seen that A, has no subgroup of index 2. Here is a cute application 
(to be generalized in Chapter 10). 


COROLLARY 8. Suppose p is prime. Any group G of order p? is isomorphic 
to Zy2 or Ly X Zy, and thus is Abelian. 


Proof. If exp(G) = p?, then G is cyclic, so we may assume exp(G) = p. 
Take elements a,b # e, neither of which generates G, such that a € (b); 


then |(a)| = |(b)| = p, so [@ : (a)] = [G : (b)] = p*/p = p, implying 
(a), (b) 1G. Hence G & (a) x (b) by proposition 6.13. 0 


Example 9. Coupled with our results of Chapter 8, this result completes 
the classification of all groups of order < 16 (except 12), as follows (listing 
only nonprime orders, since any group of prime order is cyclic): 
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(Note Liz X Lig & Le: Lis X Ly & Zio; Ly X Lhe & L443 Zs X Ls & 
Z15.-) 


Group Representations 

Cayley’s theorem and its generalization motivate us to define all finite 
groups directly as sets of permutations. (In fact, this “concrete” description 
historically preceded the abstract theory.) In principle, then, we could use 
information about S, to tell us about arbitrary groups. Let us continue 
this process by injecting S,, into a well-known group. 


Example 10. There is an injection S,, > GL(n, F) (for any field F), which 
sends 7 to the “permutation matrix” having 1 in the 72,2 position for each 2, 
and 0 everywhere else. Thus, any group G of order n can be viewed as a 
subgroup of GL(n, F) via the composition G > S$, > GL(n, F); this is 
called the regular representation. 


In general, a group homomorphism y: G > GL(n, F) is called a group 
representation of degree n (over F), also cf. Exercise 10.19. For example, 
there is a representation Q >GL (2,C) given in Exercise 8.9. The advan- 
tage of studying G through group representations is that we can bring in 
the trace of a matrix as a powerful tool; 7.e., we obtain information about 
g in G by studying the trace of the matrix y(g). This powerful method, 
called character theory, lies outside the scope of this book. 

Another useful application of Example 10 is obtained by taking F = 
Z», since then GL(n, F) is a finite group that can be studied in terms of 
various known finite subgroups (such as the subgroups of diagonal matrices, 
of upper triangular matrices, and of upper triangular matrices having 1 on 
each diagonal entry, cf. Exercises 11.2, 11.3, 11.4. 


Exercises 


1. Given a monoid M, define the opposite monoid M°? to be the same 
set as M, but with multiplication in the opposite order. Show that 
M°? indeed is a monoid, and is a group if M is a group. 


. If Gis a group, then G°? (as well as G) is isomorphic to a subgroup 
of A(G). (Hint: Use the right multiplication map.) 

. If G is a group and a,b € G then each left multiplication map é, 
commutes with each right multiplication map 7, (in A(G)). 

. Show that there is an injection from (Z¢,+) to Ss. 

. De & S3 X Zo. (Hint: Take a® = e = b? with bab~t = a7!. Then 
(a?,b) = S3 and (a*) & Zo.) 

. Show how Exercise 8.6 can be proved “intuitively,” without knowing 
the permutations in advance. 

. If G is displayed as a subgroup of S,,, find a natural injection of 
G x G into $2, (by applying Exercise 6.10) and, more generally, of 
Gx.---x G (taken m times) into Sinn. 

. Show that for any field F there is a group representation of D4 given 
by aw (eA and bw Cece 

. Generalize Cayley’s theorem to monoids: For any monoid 5, left 
multiplication by any element sin S yields a map f,:. 5 > S, thereby 
injecting S into Map(S,S), the set of functions from S to itself. 


CHAPTER 10. RECOUNTING: CONJUGACY 
CLASSES AND THE CLASS FORMULA 


Recall that b is a conjugate to a in a group G if b = gag! for suitable g 
in G. Having seen already in Chapter 5 how conjugacy arises in determining 
when a subgroup is normal, we want to study conjugacy now in its own 


1 


right. Sometimes it is useful to note that g7tag = g~ta(g7!)7? is also 


conjugate to a. 


Remark 1. Conjugacy is an equivalence relation. (Indeed we check reflex- 
ivity, symmetry, and transitivity: g = ege—'; if b = gag”! then a = gq ‘bq; 
if b= gag_' and e= hbh—', then e = (hg)a(hg)'.) 


The equivalence class of a under this relation is called the conjugacy 
class of a. We usually designate a conjugacy class by one of its representa- 
tives. 


Example 2. The conjugacy classes in S;,,. By Remark 5.20, each conjugacy 
class of a permutation o is determined by its placement of parentheses, 
writing o as a product of disjoint cycles. The reader should try indepen- 
dently to list the conjugacy classes in S4 before reading on. The big worry 
is to make sure that one has not forgotten some conjugacy class. But the 
number of possible ways of choosing a cycle of length ¢ from m possible 
letters is n(n —1)...(n—t+1)/t; we divided by ¢ since the same cycle can 
be started at any of its entries, i.e. (1234) = (2341) = (3412) = (4123). 
Thus, the conjugacy classes of S'4 are 


Conjugacy class Number of 

elements 

{(1234), (1243), (1324), (1342), (1423), (1432)} 6 

{(123), (124), (132), (134), (142), (143), (234), (243)} 8 
{(12)(34), (13)(24), (14)(23)} 3 
{(12), (13), (14), (23), (24), (34)} 6 
{()} 1 
TOTAL 24 


The total number of elements in all conjugacy classes is 24 = |.S4|. It would 
be awkward to list all the elements of S; according to conjugacy class, so 
instead we make the following table: 
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Representative Number of elements in class 


(determined combinatorically) 


5! 
(12345) rake, 
5! 
(1234) zZ = 30 
54-32 
123)(4 ep 
(123)(45) a 
5-4-3 
(123) ae 
543-2 
(12)(34) (——)/2 =15 
OG 
(1) 1 
TOTAL 120 


(In calculating the size of the class of (12)(34) we had to divide again 
by 2, because (12)(34) = (34)(12).) 

In both of the examples above, the size of each conjugacy class divides 
the order of the group. Let us now consider the size of a conjugacy class in 
an arbitrary group. We call a conjugacy class trivial if it consists of exactly 
one element. 


The Center of a Group 


Definition 8. The center of a group G (denoted 7(G)) is {2 € G: gz = zg 
for all g in G}. 


For example, 7(G) = G iff G is Abelian. On the other hand, e € Z(G) 
for every group G. 
Remark 4. z € Z(G) iff gzg7! = z for all g in G, iff the conjugacy class 
of z is trivial. 


Partitioning G into its disjoint conjugacy classes C; U---UC;, we thus 
have 


IGl= D5 lel =|2(@)| + = ICi| (1) 
1<iX<t nontrivial conjugacy classes 
(1) is called the Class Formula, and, although obvious, can be made 
quite useful by means of a few observations. We need some method of 
computing the conjugacy class of a given element a in G. To this end we 
define the centralizer C(a) = {g € G: ag = ga}. 


PROPOSITION 5. The following facts hold for for anya€ G: 
(i) Cla) <G: 
(ii) a € Cla): 
(iii) Cla) =G iffa € Z(G). 


Proof. (i) Suppose ag = ga and ah = ha. Then a(gh) = gah = (ghja; 
furthermore 


gg ‘a=a=agg '=gag', 


implying g~!a = ag™+, ef., g71 € Cla). 
(ii) and (ii) are immediate. 0 
Thus, we see |C(a)| = |G| iff the conjugacy class of a is trivial, and we 


want to generalize this (to Proposition 7). 


LEMMA 6. There is a 1:1 correspondence ¢ between the conjugates of a 
and the right cosets of C(a), given by ¢(g tag) = C(a)g. 


Proof. Clearly ¢ is onto, so we want to check ¢ is well-defined and 1:1. But 
g 'ag=h'ah iff hg 'agh"! =a, iff gh"! € C(a), iff C(a)g = C(a)h. 0 
PROPOSITION 7. The number of conjugates of a is [G : C(a)] (and thus 
divides |G). 

Proof. Count each side in Lemma 6. 0 

Corotiary 8. |G] = |Z7(G)| + SO[G : C(a)], where the right-hand sum 


is taken over representatives of the nontrivial conjugacy classes (and, in 
particular, each [G : C(a)] > 1). 


COROLLARY 9. If G is a p-group, then Z(G) # {e}. 


Proof. Each term [G : C(a)] of the right-hand summation in Corollary 8 
is a power of p other than 1, and thus is a multiple of p. Hence p divides 


S“|G : C(a)] as well as |G, so Corollary 8 implies p divides |7(G)|. 0 


Z(G) plays a special role, largely because of the following property 
(also see Exercise 1). 


PROPOSITION 10. Z(G) <G. 


Proof. Let Z = Z(G). Z < G, as in the proof of Proposition 5(i) (or just 
by noting that Z = MaegC(a)). Moreover, Z <G, for if z € Z andg EG 
then gzg't =z€Z.0 

One application of the center is a strong converse of Lagrange’s theorem 
for p-suberoups, cf. Exercise 5; related results are given in Sylow’s Theorem 
(11.8(i)) and Frobenius’ Theorem (Exercise 11.12). Let us determine more 


information concerning how Z(G) sits inside G. 


LEMMA 11. G/Z(G) cannot be a nontrivial cyclic group. 


Proof. Otherwise, letting g denote the canonical image in G/Z(G) of any 
element g of G, we write G/Z(G) = (a), for suitable a in G. Let H = 
Z(G)(a), an Abelian subgroup of G. For any g in G we have 7 = a‘ 
in G/Z(G), for suitable i, implying g = a’z for suitable z in Z(G); hence 
g € H. Therefore G = H is Abelian, so Z(G) = G, contradiction. 0 


Corollary 12. [G: Z(G)] cannot be a prime number. 


Example 18. We apply these techniques to p-groups G, where p is prime. 
Let Z = Z(G). 

(i) Reproof of Corollary 9.8. Suppose |G] = p?. |Z| #4 1 by Corollary 
9, and |Z| # p by Corollary 12. Thus, |Z| = p?, so 7 = G. 

(ii) Suppose G is non-Abelian of order p?. Then |Z| = p, and G/Z = 
Zy X Zp, since G/Z is noncyclic of order p?. 


Groups Acting on Sets: A Recapitulation 
The following recurrent theme has pervaded several of the more complicated 
proofs: 


Definition 14. An action of a group G on a set Sis amap Gx S > S, 


sending (a,s) to a: s, satisfying the following properties: 


=s forall sin S; 


=a-(b-s) for alla,bin G, sin S. 
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Remark 15. Suppose the group G acts on a set S. Any a in G yields the 
left multiplication map 4:5 > S, given by f4(s) = a-s. Then aw é, 
defines a group homomorphism ¢:G — A(S). Indeed £, is the identity, by 
(i), and £71 = £,-1 by (ii). 

As usual, the ideas of group theory guide us along the correct path. 
For any s in S we let G, = {g€ G:g-s=s}, the stabilizer of s, and we 
call G-s the orbit of s. Then, as in Lemma 6, we have 


(2) IG-s| =1G:G.] 


Example 16. Let us see which group actions have occurred so far. 


(1) The usual group multiplication in G defines an action of G on itself, 
which was the basis for the proof of Cayley’s theorem (9.1). 

(2) Theorem 9.4 provides an action of G on the left cosets of a given 
subgroup H, given by a-bH = (ab)H. 


(3) G acts on itself by conjugation, cf. g-a@ = gag~'. Proposition 7 


then can be viewed as a consequence of (2). 


(4) The trivial action is given by g-s = s for every s in S, all g in G. 


Actually, Example 16(2) is quite general, cf. Exercise 14. Rather than 


recast our proofs in terms of group actions, we refer the interested reader 


to exercises 14 ff. 


Exercises 


1. 
2: 


10. 


p(Z4(G)) C 4(A), for any surjection y:G > H. 
Suppose G is generated by elements a1,...,a4. Then z € Z(G) iff 
za; = a;zfor1<i<t. 


. Using the usual presentations of Q@ and D,, in generators and rela- 


tions, show Z(Q) = (a7): and Z(D,,) = {e} for n odd, {e,a"/?} for 


n even. (Hint: a*b? commutes with b iff n|2z.) 


. Z(S,) = fe} for n > 2; Z(A,) = {e} for n > 3. What happens for 


n = 2,3? 


. If Gis a p-group and m divides |G], then G has a normal subgroup 


of order m. (Hint: Z(G) has an element z of order p. By induction 
on m, G/{z) has a normal subgroup H/(z) of order re 


.If G=S, anda=(12...n) ora =(12...n—1), then C(a) = (a). 


(Hint: Example 2.) 


. What are the conjugacy classes in D,? What are the conjugacy 


classes in A,,? 


. Determination of all non-Abelian groups of order p*, p prime. Let 


Z = Z(G). Then there are a,b in G\ Z such that a? € 7, bP € Z, 
and e # aba 'b-' € Z. Letting z = aba~'b~' show G is gener- 
ated by a,b and z. (Actually z is redundant, but is retained for 
convenience.) 


Case I. exp(G) = p. The relations a? = b? = z? = e, zaz~+ = a, 


~! = Bb, and ab = zba permit a presentation of the group, for 


zbz 
p > 2. (What goes wrong for p = 27) 

Case II. exp(G) = p?. Then one may assume o0(a) = p?, so 
(a) 4G and bab! = a* where 7? = 1 (mod p?); since y(p?) = p(p—1) 
this has a solution 7 4 1, by Cauchy’s theorem. 

Use Cayley’s theorem to display these groups concretely. 


. There are two nonisomorphic finite groups G and A such that, for 


each n, G and H have the same number of elements of order n. 
(Hint: Use the non-Abelian group of exponent p, found in Exer- 
cise 8.) Compare with Exercise 7.2. 

If Z(G) = {e} and |G| =n, then S,, contains a subgroup isomorphic 


Ti: 


12, 


13. 


14. 


15. 


16. 


to G x G°?, cf., exercise 9.1 (Hint: The canonical copies of G and 
G°P in S, intersect in Z7(G).) 


Double Cosets 

Given subgroups H, K of G and g € G define the double coset Hg kK 
to be {hgk :h € H,k € K}. Show that the double cosets (with 
respect to H, K) comprise a partition of G. 

Notation as in Exercise 11, show for ky, k2 in K that Hgk, = Hgk» 
iff ky ky" € Kg ‘Hg. Conclude that the number of ordinary 
cosets of H in HgK is [K: KO g7'Hogl. 

Prove the “double coset” version of the class formula, for any sub- 
groups H,K of G: 


IG: H] = S[k :K Ng "Hg. 
geEG 


Group Actions on Sets 
(In these exercises, we suppress the - in the notation of the 
group action.) An action of G on a set S' is said to be transitive 


if for every s1,82 in S there is g in G such that gs; = s2; more 
generally, the action is t-fold transitive if for any $1,..., $4,545...) 
in S' there is g in G such that gs] = s), ..., 95+ = $s}. 


We say actions of G on sets S and T are equivalent if there is a 
1:1 correspondence y:S — T which “preserves” the action, 1.e., 
p(gs) = gy(s) for all s in S. Any transitive group action of G 
on S' is equivalent to the one of Example 16(2), where H = G,, 
for any given sq in S. (Hint: Taking T = {left cosets of H}, define 
p:S > T by v(s) = gH, where g is taken such that s = gs.) 

A G-partition of S is a disjoint union S = S$, U---U S; of 
nonempty subsets stable under the given action, t.e., such that for 
each g in G, gS; = S; for suitable 7. Clearly, S has two “trivial” 
G-partitions: 

(i) #=1, and S; = S; 

(ii) t = |S], and writing S = {s1,...,5:}, take each S; = {s;}. 

We say the action is primitive if these are the only G-partitions. 
The next exercise shows that the definition depends only on 51: 
An action is not primitive iff S has a proper subset 5S’ of order > 1, 
such that, for each g in G, either gS’ = S' or gS' NS! = @. (Hint: 
{gS" : g € G} is a G-partition.) 

A transitive action is primitive iff G, is a maximal subgroup of G, 
for any s in S. (Hint: By Exercise 14, one may assume the action 
is as in Example 16(2); apply Exercise 15) 


abies 


18. 


19. 


Any two-fold transitive action is primitive. Conversely, if H < G 
is not contained in the kernel of the action and if the action is 
primitive, then the action of H on S is either transitive or trivial. 
If H < G acts transitively on S, then for any s in S, one has 
G = HG,. 
Suppose y: G >GL(n, F) is a group representation. Viewing M,,(F) 
as the set of linear transformations of an n-dimensional vector space 
V over F, one defines an action of G on V, by g- v = y(g)(v), for 
gin G and vin V. Verify the following properties: 

G) g:-(vu+w)=g-v4+gq-w for all gin G,v,win V; 

(ii) g- (av) = ag-v for all g in G,a in F, vin V. 

A vector space V over F is called a G-module if there is an 
action of G on V satisfying (i) and (ii). Thus, every represen- 
tation gives rise naturally to a G-module. On the other hand, 
given a G-module V that is n-dimensional over F, one can de- 
fine y:G SGL(n, F) by taking y(g) to be the linear transformation 
ur g-v. In this way conclude that there is a 1:1 correspondence 
between the group representations of G (over F’) and the G-modules 
that are finite dimensional as vector spaces over F. 


20. Two G-modules are isomorphic iff their G-actions are equivalent. 


CHAPTER 11. SYLOW SUBGROUPS: A NEW INVARIANT 


In the discussion of finite Abelian groups we saw that every Abelian group 
is isomorphic to the direct product of its Sylow subgroups. Of course this 
fails terribly for most non-Abelian groups, but amazingly enough, one can 
salvage a good part of this theorem. In the process we shall come up with 
a powerful new invariant — the number of p-Sylow subgroups. Our basic 
tool is a generalization of conjugacy classes. First we generalize centralizers 
of elements. 


Definition 1. Given any set A C G, define its nermalizer N(A)={g EG: 
gAg 1 = At}. 


Remark 2. N(A) < G, for if gAg7! = A and hAh7! = A, then A= h71Ah 
and, thus, gh! A(gh7!)~! = gh" Ahg7! = gAg 1} =A. 


Of course, this can be viewed in terms of group actions: G acts on 
{subgroups of A} by conjugation; the normalizer of A is just the stabilizer 
of A under this action. 

Note that N({a}) = C(a). Thus, Remark 2 includes Proposition 10.5(i). 
However, we shall be interested mainly when A < G, in which case AdN(A), 
thereby justifying the name “normalizer.” (In particular, if G is Abelian, 
then N(A) = G.) We want to generalize this definition a bit further. If 
HAT < G we define a conjugate of A with respect to H to be a set of the form 
h7 Ah for hin H. 


LEMMA 3. There is a 1:1 correspondence between the conjugates of A with 
respect to H, and the cosets of HM N(A) in H, given by 


ht Ah > (HM N(A))h. 


Proof. In parallel to the proof of Lemma 10.6 we note that the correspon- 
dence is onto, so we need merely note 


hytAhy =hy'Ahy iff hgh A(hehy')7! = A, 
iff hoh>'€ HON(A), iff (HN N(A))ho =(HON(A))y. O 


PROPOSITION 4. The number of conjugates of A with respect to H is 
[H: HN N(A)]. 

Proof. Just count in Lemma 3. O 

CoroLiary 5. The number of conjugates of A (in G) is [G: N(A)]. 
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Now let us try to take as much of the Sylow theory from Abelian groups as 
possible. 


PROPOSITION 6. Suppose K is a p-Sylow subgroup of G, and H 1s a p- 
subgroup of G. If H C N(K), then H CK. 


Proof. K4N(K),so HK < N(K), and |HK| = = is a power of p. 


Thus, HK is a p-subgroup containing K,so HK = K,implying H C K.O 

Note that when G is Abelian H C N(K) = G, so we have a general- 
ization (albeit modest) of Theorem 7.11(ii), which suffices for the following 
key computation. 


COROLLARY 7. Suppose K is a p-Sylow subgroup of G, and H is any 
p-subgroup of G. Let m be the number of conjugates of K with respect 
to H. Then m is a power of p, which is 1 iff H C K. 


Proof. m = |H : HM N(K)], a power of p, and is 1 iff H C N(K), iff 
H CK by Proposition 6. 0 


We are ready for possibly the most important basic theorem in group 
theory. 


THEOREM 8. (Sylow’s Theorems.) Suppose p is a prime dividing |G|. Write 
my, for the number of p-Sylow subgroups of G. 


(i) If pt divides |G|, then G has a subgroup of order p*; in particular, 
G has a p-Sylow subgroup, so m, > 1. 

Every p-subgroup is contained in a p-Sylow subgroup. 

Any two p-Sylow subgroups of G are conjugate. 

My, =1 (mod p). 

My, divides I. where p' is the largest power of p that divides |G|. 
More precisely, m, is the index of the normalizer of any p-Sylow 
subgroup (in G). 


Proof. The proof is comprised of two main parts. First we use the Class 
Formula to prove (i). Then, knowing that a p-Sylow subgroup exists, we use 
it to set up a machine that grinds out the rest of the theorem by means of 
a clever counting argument based on Corollary 7. We shall use repeatedly 
the following easy observation : If u = 4 and p* | n but p*{m, then p | u. 
Let us turn to the proof itself. 

(i) By induction on |G|: if pt divides |H| for some H < G, then by 
induction H has a subgroup of order pt, which is then a p-subgroup of G. 
Thus, we may assume that pt does not divide |H]|, for every H < G. In 


particular, p' { |C(a)| for every a € G \ Z(G); hence p|[G : C(a)] whenever 


a ¢ Z(G). But p divides |G|, so Corollary 10.8 implies p divides |7(G)]. 
Cauchy’s theorem implies Z(G) contains an element z of order p. 

Clearly, (z) < G, and |G/(z)| = — By induction, G/({z) contains a 
subgroup of order pt~!, whose preimage in G is then a subgroup of order 
p-p'-1 =p', yielding (i) as desired. 

Now that we have proved G has a p-Sylow subgroup, which we denote 
as K, we continue with the following set-up: Let C = {conjugates of K}, 
which clearly are p-Sylow subgroups of G. Let m = |C|, the number of 
subgroups of G conjugate to K. Hence m = [G: N(K)], by Corollary 5. 
But p{[G: K], and |K]| divides |N(K)|, by Lagrange, so p fm. 

Now take any p-subgroup H of G. Conjugation with respect to H 
defines an equivalence relation on C, thereby partitioning C into C,U---UC;, 
for some k. Then 


IC] = |G] + +--+ [Cel 


so pf |C;| for some j. 

For each z let u; = |C;|, and pick K; arbitrarily in C;. By Corollary 
7, each u; is a power of p, which is 1 iff H C K;. But p{ uj; so u; = 1, 
implying H C K;. This proves (ii). 

Next, we take H to be a p-Sylow subgroup, and continue from where 
we just left off. |H| = |.A;| implies H = Kj, proving H is conjugate to K, 
yielding (iii). In particular m, = m. On the other hand, H ¢ K; for all 
i # 7 (since otherwise K; = H = K;, which is false). Thus, for each i # j 
we see u; # 1, implying plu;. Hence 


My = Ut... + Uj tUy + Ujzit... + UR 
=O +...4+0 +1+0 +---+0 =1 (mod p), 


proving (iv). 
Finally, m, = m = [G : N(K)], which we saw divides [G: K] = Bee 
yielding (v). 0 


Theorem 8(iv) should remind us of an analogous result obtained earlier, 
in Exercise 3.2, and lead us to wonder whether there is a general theorem 
concerning the number of subgroups of order p*. In fact, such a result (due 
to Frobenius) exists, ef. Exercise 12, which has a very slick proof using 
group actions. 


Remark 9. One point of Theorem 8 is that m, = 1 iff the Sylow p-group is 
normal in G, in view of Proposition 5.7. This gives us a powerful tool for 
obtaining nontrivial normal subgroups. 


Groups of Order Less Than 60 
What can we say so far about classifying finite groups? Let p,q denote 


distinct prime numbers. 


iB 
2: 


If |G| =p then G & Z, is cyclic, by Theorem 6.2. 
If G is Abelian, then G is a direct product of cyclic groups, by 
Theorem 7.2. 


. If |G| = p?, then Gis Abelian, so G & Z* or Z,x Z, by Corollary 9.8. 
. If |G| = pq for p > q, then either G = Z, x Zq * Zp, or G is the 


semidirect product of Z, by Z,; this latter group only exists if q 
divides p— 1, and then is non-Abelian and uniquely determined ( cf. 
Example 8.8). 


Assume in the remainder that |G| is not prime. We shall prove that 


G has a nontrivial normal subgroup if |G| < 60. By what we have just seen, 


this is clear if Gis Abelian or has order pq, so we exclude these cases from 


the subsequent discussion. 


5. 


6. 


10. 
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If |G| = p* for t > 3, then the normal subgroup Z(G) has possible 
order p,p”,....p" 7. (This covers the cases |G| = 8,16, 27,32.) 

If |G| = p'v for p > v > 1, then m, = 1, by Theorem 8(iv),(v), 
so the p-Sylow subgroup is normal, by Remark 9. (This covers the 
cases |G] = 18, 20, 28, 42,44, 50,52,54.) Note that » was taken not 
prime when ¢ = 1, in view of (4) above. 


. If |G| = 40 or 45, then ms = 1, so the 5-Sylow subgroup is normal, 


by Remark 9. (Also cf. Exercise 6.) 


. If |G| = 12, 24, or 48, then any 2-Sylow subgroup of G either is nor- 


mal or contains a nontrivial normal subgroup of G, by Corollary 9.6. 
(Also cf. Exercise 8.) 


. If |G] = 36, then any 3-Sylow subgroup of G cither is normal or 


contains a nontrivial normal subgroup of G, by Corollary 9.6. 

This leaves us with 30 and 56. 
If |G| = 56, then m; = 1 or 8. If there are eight subgroups 
Ay,...,Hs of order 7, then noting |H; 9 H;| = 1 for each i # 7 
(since 7 is prime) we see each H; has six elements of order 7, and 
these are all distinct, so G has 6-8 = 48 elements of order 7; 
hence only eight elements have order #7, and these are needed for 
a unique subgroup of order 8 = 23. We have proved m7 = 1 or 
mz = 1,80 G has a nontrivial normal subgroup. 
If |G| = 30 then m3 = 1 or ms = 1, as argued above. (Otherwise, 
ms = 10 and m; = 6, so G would require twenty elements of order 
3 and twenty-four elements of order 5.) We can push this much 
further, cf. Exercise 7. 


In summary, we have proved the following useful fact: 


PROPOSITION 10. Every group G of nonprime order < 60 has a nontrivial 
normal subgroup. 


Finite Simple Groups 
There are groups failing to have any nontrivial normal subgroups; such 
groups are called simple. Obviously the only simple Abelian groups are the 
cyclic groups of prime order. The easiest non-Abelian example of a simple 
group is As, which has order 60 (thereby showing that Proposition 10 is 
sharp); in Exercise 5.6 we saw that A,, is simple for all n > 5. 

Two famous theorems that can be formulated about finite simple groups 
are: 


BURNSIDE’S THEOREM. If G is nonabelian, finite simple then |G| has at 
least three distinct prime factors. 


FEIT-THOMPSON THEOREM. Every nonabelian finite simple group has 
even order. 


(The original proof of the Feit-Thompson Theorem required 250 pages in 
a research journal!) Actually these two theorems are usually formulated in 
terms of “solvable” groups, as will be explained in Chapter 12. 

Recently the complete classification of simple groups has been finished, 
and consists of 18 infinite families (one of which is the family of cyclic 
groups of prime order, and another of which is {A, :n > 5} ), as well as 
26 “sporadic” groups, some of which crop up for rather mysterious reasons. 
This theorem is a major triumph for mathematics in the second half of 
the twentieth century, having required the combined efforts of dozens of 
research mathematicians aided by computers; however, it also raises ques- 
tions as to what a “proof” is, since a complete proof of this result has not 
yet been written down from start to finish. 

In the exercises of Chapter 3 we were introduced to the classical groups, 
which are certain subgroups of GL(n, F) and their homomorphic images. 
These provide several important classes of simple groups, which are finite 
when F is a finite field. Some of this theory will be outlined in the ex- 
ercises of Chapter 12; details may be found in Chapter 6 of Jacobson’s 
Basic Algebra I (1985), Freeman, San Francisco. The reader interested in 
The Classification could pursue the matter in Gorenstein’s tomes on group 
theory. 


Exercises 


1. Show how Lemma 3 is really a special case of Remark 10.15. 


. Let H be the subgroup of G =GL(n, F) consisting of all “upper 
unipotent” matrices, 7.e., upper triangular matrices whose diagonal 
entries are all 1. If |F| = q is a power of p, show H is a p-Sylow 
subgroup of G. (Hint: |H| = q™"—)/?; compare with Exercise 1.6.) 
_ If H is a p-Sylow subgroup of G and K < G, then KN gHq™? is 
a Sylow p-subgroup of K, for suitable g in G. (Hint: By the class 
formula for double cosets, ef. Exercise 10.13, some [K : KNgHg™] 
must be prime to p.) 

. Reprove Theorem 8(i) by applying Exercise 3 to Exercise 2, noting 
that any finite group of order n can be embedded into GL(n, Z,). 

. If Sis a p-Sylow subgroup of G, then N(N(S)) = N(S). (Hint: S is 
the only p-Sylow subgroup of N(S), implying gSg7! = S for all g 
in N(N(S)).) 


Classification of Groups of Various Orders 

. Every group of order 45 is isomorphic to Z3 x Z3 x Zs or Zo X Zs. 
(Hint: m3 = ms = 1.) 

. The groups of order 30 (up to isomorphism) are Z3q, Dis, Z3 x Ds, 
and Z; x D3. (Hint: Let |H| = 3 and |K| = 5. Then H or K is 
normal in G, implying N = HK < G. But |N| =15 so N & Zs. 
Write N = (a)<G. Take b of order 2. bab~! = a’ where 7? = 1 
(mod 15); there are four solutions.) 

. Every group of order 12 is isomorphic to one of the following five 
groups: Z4 Z3, ZX Z.x Z3, Dg & S3 Xx Zo, Ag, and a fifth group 
to be described in the hint. (Hint: Case I. mz = 1 and m3 = 1. 
Then G is Abelian. 

Case II. mz = 1 and m3 4 1. Let N be the 2-Sylow subgroup 
of G. Suppose o(a) = 3. If N were cyclic, then take o(b) = 4 and 
note aba~? = b~+ contrary to o(a) = 3. Thus, N = fe, by, b2,b3} 
with each 0(b;) = 2. One may assume abja7! = b;41 (subscripts 
mod 3), so G is uniquely determined, and thus G = Ag. 

Case III. my. #4 1. Then m3 = 1, by counting the number of 
elements in G. By Corollary 9.6 any Sylow 2-subegroup contains 
a normal subgroup (z) of G of order 2, so z € Z(G). Suppose 
o(a) = 3, and let ¢ = az, which has order 6. Take 6 € G with 
beb—! # ec. Replacing b by a suitable odd power of b, one may assume 
o(b) is a power of 2. If o(b) = 2 then G & Dg; if o(b) = 4, then 
6? = c? = z, and one has an analog of Q.) 

. Determine every group G of order 18. (Hint: Let N be the Sylow 
3-subgroup of G. If N is cyclic, then G & Zig or Dg. If N is 
noncyclic, then N x Z3 x Z3, which has generators a1, a, of order 3. 


10. 


1: 


12: 


Take 6 of order 2. 

Case I. b commutes with both of the a;. Then G & Zs. 

Case II. b commutes with one of the a;, but not the other. 
Then G & D3 x Zo. 

Case III. bayb = ag. Then bayazb = ayaz, so this case 
degenerates to case II. 

Case IV. ba,b = 7 ie and bagb = G5 Describe this group 
as a subgroup of S¢. 
Classify all groups of order < 32, except for those of order 16. (Hint: 
The previous results cover all orders except 24 and 28. |G| = 28 is 
rather like Case ITI of Exercise 8 (since m7 = 1); 24 is trickier, since 
there are five possibilites for the 2-Sylow subgroup.) 
Q cannot be isomorphic to a subgroup of S, for n <5. (Hint: D4 is 
a 2-Sylow subgroup of Ss.) 
(Frobenius’ theorem) If p* divides n = |G, then the number m of 
subgroups of G having order p* is congruent to 1 (mod p). (Hint: 
Let |G| = p*q. Let S = {subsets of G of order p*}, and let G act on 
S by left translation. If S € S, then,writing S as a disjoint union 
of Gs-orbits, note that |Gs| divides || = p*. 

Let So ={S ES: |Gs| =p*}. If S € S\ So, then pq divides 
|G - S|; so |S] = |So| (mod pq). 

On the other hand, if S € So, then for any s in S, note that 
Ggs is a subset of S$ of order p*, so S = Ggs is a coset of Gs: 
conversely, for any coset Hg of any subgroup H having order p*, 
show that Hg € So. Thus, |So| = m= = mq. 

Conclude m = = (mod p). But |S| depends only on n and p*, 
t.e., is independent of the particular group structure of G. Thus, it 
suffices to prove the result for any group G of order n; conclude by 
taking G cyclic.) 


CHAPTER 12. SOLVABLE GROUPS: WHAT COULD BE SIMPLER? 


Certain groups are considerably more general than Abelian groups, but 
possess some of their nice properties. We shall discuss these groups now, 
in preparation for an application in Chapter 26. The underyling question 
is, “How far is a given group from being Abelian?” The answer lies in 
the rather trivial fact that ab = ba iff aba~'b~! = e, which we needed in 
Proposition 6.13. This leads us to view the subset of elements of the form 
aba~+b~* as the obstruction to a group being Abelian. 


Commutators 


Definition 1. The group commutator [a,b] is defined as aba~1b7!. The 
commutator subgroup G' is the subgroup of G generated by all group com- 
mutators in G. 


Remark 2. (i) If H <G then H' < G’. 
(ii) [a, b]~' = bab-ta~! = [b, a]. Thus, every element of G’ is a product 
of group commutators. (See, however, Exercise 19.) 


PROPOSITION 3. (i) f(G’) C A’, for any group homomorphism f:G > H. 
(ii) G'<G. 


Proof. (i) f({a1. 61] -.-[@n,n]) = [f(a1). f(b1)] ---[F(an), f(On)] € A! 
(ii) Taking fy:G — G to be the homomorphism given by a> gag™', 
we see by (i) that gG’g7! C G’, for each g in G. Hence G'«G. 0 


This thread is carried further in the addendum, but meanwhile let us 
get to the main point of commutators. 


PROPOSITION 4. (i) G/G' is an Abelian group. 
(ii) Conversely, if N<G and G/N is an Abelian group, then G' C N. 


Proof. (i) For any a, in G we sce [a,b] € G’ by definition, so the images 
of a and b commute in G/G’. 
(ii) For any a,b in G we have [a,b] € N, so GCN. O 


In particular, G is Abelian iff G' = {e}. On the other hand, here is an 
example with G' = G. 


Example 5. (i) If Gis simple non-Abelian, then G’ = G (since G' # {e}). 
In particular, Ai, = A,, for all n > 5. 

(ii) S! = A, for all n > 5. Indeed, S,/A, is Abelian, implying S!, C 
Ay, but S! D Al = Ay. 
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Solvable Groups 


Definition 6. A subnormal series from G to a subgroup N is a chain of 
subgroups, 


(1) G=G)>G,>G2>°::>G=N, 


where each G;41 <G;. We call the G;/Gi41, 1 <i < t, the factors of the 
series. G is solvable if there is some subnormal series from G to {e}, with 


each factor G;/Gi41 Abelian. 


In particular, every Abelian group is solvable. The key to solvability 
lies in the derived subgroups G™ of G, defined by GO = G and inductively 
GD = Ga’, 


PROPOSITION 7. G is solvable iff some G = {e}. 


Proof. (€) Take G; = G™. Then G;/Gis1 = Gi/G', is Abelian, by Propo- 
sition 4(i). 

(=>) Given a subnormal series G = Go > Gi > G2 > +--+: > G; = fe}, 
we shall prove by induction on i that G < G; for 1 <i < t. (Then 
GO < G, = fel). Indeed, G = G = Go. Now assume GO-) < Gj_4. 
Then G@ = GG-0! < Gi_1' < Gj, since G;_1/G; is Abelian. 0 
Remark 8. Assume N<«G. We can understand subnormal series better by 
recalling that H — H/N defines a 1:1, onto, order-preserving correspon- 
dence between {subgroups of G containing N} and {subgroups of G/N}; 


furthermore, by Noether II (Theorem 5.17), K/H x (K/N)/(H/N) if 
N<H<K with «4K. Thus, any subnormal series from G to N, 


G=G)>G,>G,>-:->G,=N, 
corresponds to the following subnormal series from G/N to N/N = {e}: 
G/N =Go/N > Gi/N > G2/N >--- > Gi/N = {e}, 
where the corresponding factors G;/Gi41 and (G;/N)/(Gi41/N) are iso- 


morphic. 


THEOREM 9. Suppose N<G. 
(i) If G is solvable, then N and G/N are solvable. 
(ii) If N and G/N are solvable, then G is solvable. 


Proof. (i) We use Proposition 7. By induction, N“ < G™ for each i; if 
G = {e}, then N = {e}, proving N is solvable. Similarly (G/N) 


is the image of G in G/N (seen by taking cosets at every stage), so if 
G® = fe}, then (G/N) = {e} . 

(ii) We are given a subnormal series for G/N having Abelian factors, 
which we translate to a subnormal series from G to N, and then continue 
with a given subnormal series from N to {e} having Abelian factors, thereby 
obtaining a subnormal series from G to {e} having Abelian factors, implying 


G is solvable. O 
COROLLARY 10. Every p-group is solvable. 


Proof. By induction on |G|. Z(G) 4 {e} and is Abelian and, thus, solvable. 
G/Z(G) also is a p-group, so by induction is solvable; hence G is solvable. 0 


(See Exercise 4 for an improved version of this result.) 
COROLLARY 11. Every group of order < 60 is solvable. 


Proof. We may assume G is non-Abelian. But by Proposition 11.10, G has 
a nontrivial normal subgroup N. N and G/N have smaller order than |G], 
so by induction are solvable; hence G is solvable. O 


A similar induction argument enables one to extend the two deep the- 
orems quoted for simple groups in Chapter 11, to the formulation in which 
they are usually remembered: 


BURNSIDE’S THEOREM. If |G| = p'q’ for p,q prime, then G is solvable. 


FEIT-THOMPSON THEOREM. Every group of odd order is solvable. 


Let us close with one final application of Remark 8, which will be 
needed in Chapter 27. 


PROPOSITION 12. A finite group G is solvable iff it has a subnormal series 
each of whose factors is cyclic of prime degree. 


Proof. We start with a subnormal series 
G—= Go > Gi > Go >-:+->G: = fe}, 


having Abelian factors. We want to insert enough entries between each 
G; and Gj41 for the factors to be cyclic of prime degree; 7.e., we want a 
subnormal series from G; to Giz1, having cyclic factors of prime degree. 
In view of Remark 8, it is equivalent to find a subnormal series from the 
Abelian group A = G;/Gj41 to {e} having cyclic factors of prime degree. 
But this is done easily by induction on n = |A]. If nis prime, then A already 
is cyclic; if n is not prime, take a prime factor p of n and an element a in A 
of order p, and apply induction to A/(a), to obtain the desired subnormal 
series from A to (a). 0 


Addendum: Automorphisms of Groups 

An automorphism of a group G is an isomorphism from G to G. Group 
automorphisms play an important role in the theory of groups, and so we 
would like to discuss them briefly here. We have already encountered one 
type of automorphism — for any a in G one has the inner automorphism 
given by g > aga~', which we denote here as Tq. 

The set of automorphisms of G is easily seen to form a group called 
Aut(G), whose group operation is the composition of maps; the inner 
automorphisms, denoted Inn Aut(G), comprise a normal subgroup (for if 
o € Aut(G) and 7, € Inn Aut(G), then 07,07! = To(q)). Of course, if G is 
Abelian, then Inn Aut(G) = (1). 


Example 13. Suppose G = (g) is cyclic of order n. Then any homomorphism 
o:G — G satisfies o(g) = g™ for suitable m; conversely, given m we can 
define a homomorphism o:G > G given by gi »& g'™ for each i. Then 
o is an automorphism iff o(g”) = o(g) =n, which is true iff m and n are 
relatively prime. It follows easily that Aut(G) = Euler(7). 


Definition 14. A subgroup A of G is characteristic, written H char G, if 
o(H) C H for every o in Aut(G). 


Noting that H 4G iff o(H) C H for every o in Inn Aut(G), we see 
that every characteristic subgroup is normal, and we draw a few paral- 
lels between characteristic subgroups and normal subgroups, omitting the 
straightforward proofs. 


Remark 15. (i) H char G iff o(H) = H for every ¢ in Aut(G). 
(ii) If H is the unique subgroup of G having order m, then H char G. 
In particular, every subgroup of a cyclic group is characteristic. 


On the other hand, Exercise 23 shows a significant way in which char- 
acteristic subgroups behave “better” than normal subgroups. 


Exercises 


1. Show A, = {(1), (12)(34), (13)(24), (14)(23)}. (Hint: This is the 
only nontrivial normal subgroup of A4.) 

2. Di, = (a*), under the notation of definition 8.5. (Hint: (a?) is 
a normal subgroup, and the residue group has order 2 or 4.) Of 
course, (a?) = (a) iff n is odd. 


Nilpotent Groups 

3. Define G! = G’, and G’*! to be the subgroup of G generated by 
{[a,g]: a € G’, g € Gh. Gis called nilpotent of class t if some 
Gt = {e} (t minimal such). Show that G™ C Gt; hence every 


10. 


1 


nilpotent group is solvable. The center of every nilpotent group is 
nontrivial. Thus A4 is an example of a solvable group that is not 
nilpotent, since its center is trivial. 


. The central series is defined via 7, = Z(G), and inductively 7;41(G) 


is that subgroup H D Z;(G) of G for which H/27;(G) = 7(G/Z;(G)) 
Show by induction on ¢ that G is nilpotent of class t, iff 7,(G) = G 
Conclude that every p-group is nilpotent. 


. If Gis a nilpotent group, then N(H) > H for every H < G. (Hint: 


If 7;(G) < H, then 7;41(G) < N(H).) Conclude from Exercise 11.5 
that G has a unique Sylow p-subgroup, for each prime p dividing |G]. 


. A finite group is nilpotent iff it is a direct product of p-groups. 


(Hint: Exercise 5.) 


. Let Z = Z(G). Gis nilpotent of class 2 iff G’ C Z. Conclude that 


the following identities hold for all a,6,¢ in G: 
[ab, c] = [a, c][b, e]; [a, be] = [a, b][a, e]; 
(ab)" = a" b"[a, br —VP?, 
(Hint: be = cbz for suitable z in Z.) 


. If G is nilpotent of class 2 and is generated by ay1,...,a,, then 


every element of G has the form at ...a'nz where z € G’ C Z, 


5) commutators [a;,a;]. 


and consequently G’ is generated by the ( 
for l<i<g<n. 


The Special Linear Group SL(n, F), for a Field F 


. SL(n, F)<GL(n, F), and GL(n, F)/SL(n, F) & F \ {0}. (Hint: By 


Exercise 4.3.) Using Exercise 1.6, compute |SL(n, F)| when F is 
finite. 

For i # j, define the elementary matrices T,;(a) = I + ae;; in 
SL(n, F). (7 denotes the identity matrix.) Tij(a)” = Ti;(na@), for 
all n in Z. In particular, T;;(a@)~! = Tij;(-a); if F = Z»y, then 
Tij(a@)? = I. Conclude that [Tj;(@).Tj.(8)] = Tix(aB) if i A k: 
[T:3(@), Tre(B)] =Jifi = £ and j - k. 

Multiplying a matrix A on the left (resp. right) by Tj;(@) takes A 
and adds on a times the jth row (resp. 2th column) of A to the ith 
row (resp. jth column). Prove SL(n, F) is generated by elementary 
matrices. (Hint: There are elementary row and column transforma- 
tions taking A € SL(n, F) to a diagonal matrix: equivalently there 
are suitable products P and Q of elementary matrices such that PAQ 


is diagonal, so assume A is diagonal. But for 0 4 c,d € F there are 


cd 0 


elementary transformations taking jc : to ( : 4 (in five steps); 


12. 


13. 
14. 


15. 


16. 


17. 


writing d,,...,d, for the diagonal entries of A, one can thus trans- 
form diag{d,,...,d,} to diag{d,,...,dn—1dn,1}, and continue by 
induction to arrive at the identity matrix.) 

Suppose n > 2. Then SL(n, F)!’ = SL(n, F) unless n = 2 with 
|F| < 3. (Hint: By Exercise 11 it suffices to display Tj;(a) as 
a commutator. Ty2(a) = [Ti3(a),7T32(1)], if n > 3. Ifn = 2, 
then assuming |F| > 3, take y # 0 with y? 4 1, and solve for 


Tr(a) =((7) 2) .T2(A))) 
GL(n, F)' = SL(n, F), under the hypotheses of Exercise 12. 


Z(SL(n, F)) = {al : a" =1 in F}. 


The Projective Special Linear Group PSL(n, F) 

Define PSL(n, F) = SL(n, F)/Z(SL(n, F)). PSL(n, F) is trivial 
when n = 1, so we assume n > 1. If |F| = q is finite, |PSL(n, F)| = 
(q” —1)(q” —q)...(q? — q?~*)q”“1/d, where d = gcd(n,q—1). In 
particular, PSL(2, F) has order (q? — 1)q for q even, and ive for 
q odd. (Hint: Exercises 9 and 14.) 

Let F(™ denote the standard n-dimensional vector space over the 
field F, and let S = {1-dimensional subspaces of F‘")}. SL(n, F) 
acts on S' by the rule A(Fv) = Fw where w = Av (for A € SL(n, F), 
04v€ F™); show that (AB)(Fv) = A(B(Fv)) and 


Z(SL(n, F)) = {A € SL(n, F): A(Fv) = Fv for allu 40€ F™}, 


(Hint: If v,w are F-independent and Av = av, Aw = Bw, and 
A(ut+w) = y(v+w) with Fv 4 Fw, then (a — y)v = (y — B)w so 
a=7= fp.) 

PSL(n, F) is a simple group, unless n = 2 with |F| < 3. (Hint: 
Let G = SL(n, F). Suppose Z(G) C N«G. Show N = G, in the 
following series of steps, notation as in Exercise 16. 

1. If Fu, 4 Fv, and Fw, # Fw,then there is Ain G such that 
A(Fv;) = Fw, for i = 1,2. (This is because any two independent 
rows can be extended to an invertible matrix.) 

2. For any v,w in F there is B € N for which B(Fv) = Fw. 
(Hint: Since Z(G) CN, there is « in F(/™ and Bo € N for 
which Bo(Fa) #4 Fx. Take A in G such that A( Fz) = Fv and 
A(Bo(Fz)) = Fw. Then AB,A7! € N, and AB) A71(Fv) = 
AB) (Fx) = Fw.) 

3. Let vy = (1,0,...,0)€ F(™ and H={AEG: Av, € Fuy}. 
Then G = NH. (Hint: Given any A in G take B in N such that 
AFv, = BFv,; then B-1A € FH. 


18. 


19. 


20. 
21. 


22. 


23. 


24, 


25. 


26. 


4. Identify H with the set of matrices having (2, 1)-position 
0 for every i > 1. Let K be the subgroup of G generated by all 
elementary matrices of the form T,;(a), a € F. Then K «4H, so 
NK«NH =G. But every T;;(a) is conjugate to T,;(a), and thus 
is in NK, implying NK = G; thus, G/N = K/(NO K). 

Since K is Abelian, conclude that N > G' = G.) 
Using Exercises 10.17 and 10.18, show that the proof of Exercise 17 
can be applied to obtain the following more general result: Suppose 
we are given a group G = G’ and a primitive action of G on a 
set S. Suppose, moreover, for some s in S that G, contains some 
normal Abelian subgroup A of G, for which {UgAg ' : g € G} 
generates G. Then G/K is a simple group, where K is the set of 
elements of G acting trivially on S. (There are several important 
instances where this holds, in groups arising from bilinear forms.) 
If n = [G : 2(G)], then the number of distinct commutators in 
G is at most n(n —1). (Hint: [z1a, 226] = [a, 6].) Conclude that 
some element of G’ is not a commutator, if |G’| > n(n — 1). To 
construct such an example, find a subgroup G of SL(m,Z,) (for 
suitable m > n), generated by suitable matrices A,,...,A,, such 
that G is nilpotent of class 2, |G/Z(G)| = p”, and |G’| = p™"—-D/? 
(Hint: Refer to exercises 7 and 8. In the notation of Exercise 10 
write E;; for T;;(1). For n = 3 take Ay = Ey., Ao = E34 Eo7, and 
A3 = F’se Fog E49. Generalize to arbitrary n.) 


Exercises for the Addendum 

Inn Aut(G) = G/Z(G). 

G = Z» x Z, has subgroups that are not characteristic (but are 
normal since G is Abelian). 

Prove that if Aut G = {1}, then |G| < 2. (Hint: Inn Aut G = {1} 
implies G is Abelian, and thus has the automorphism g 1 g7!; 
hence exp(G) < 2.) 

If H char K and K char G, then A char G. If H char K and K4G, 
then H «4G. However, H<« K and K 4G do not necessarily imply 
AAG. 

The derived subgroups G™ are characteristic. (Hint: Apply induc- 
tion to Exercise 23.) 

A Sylow p-subgroup of G is characteristic iff it is normal. 


Semidirect Products, Also Cf. Example 8.8 
Suppose H,G are groups, with a homomorphism H — Aut(G) 
given by 0 +} @. Define a binary operation on the cartesian product 


27. 


28. 


29. 


30. 


3l. 


32. 


K = GxH by 


(91,7)(92,7) = (g1F(92),07). 


Show this defines a group structure on K, called the semidirect 
product of G by H. Note that K has subgroups G = G x {e} and 
H = {e} x H satisfying the following properties: 

K=GH=HG: G«k; GnH = {el}. 
Conversely to Exercise 26, suppose K is any group having subgroups 
G and Hi satisfying K = GH, G«K, and GN A = {e}. Then K is 
isomorphic to the semidirect product of G by H. (In this situation 
we say K is the internal semidirect product of G by H.) 
Suppose a group K has a subgroup G of index p, where p is the 
smallest prime number dividing |K|, and K \ G has an element h 
of order p. Then K is the internal semidirect product of G by (h). 
For p an odd prime, display the non-Abelian group of order p*® and 
exponent p as a semidirect product of Z, X Zy by Zp. 


The Wreath Product 
Supppose A,G are groups. The wreath product W of G by A is 
defined as follows: Let B = {functions from A to G}, viewed as a 
eroup by means of “pointwise multiplication;” 7.e., if we write (ga) 
for the element of B given by a +> gq, for each a in A; then for 
each (ga).(ha) in B we define (ga)(ha) = (Gaha) € B. G acts as a 
group of automorphisms of B, by “translation,” 7.e., each h in G 
induces the homomorphism o;, of B given by on((ga)) = ((hg)a); 
W is defined as the semidirect product of B by G. Show this is a 
group and compute its order. The wreath product is an important 
tool for constructing weird groups, but in its easiest instance (see 
the next exercise) produces quite manageable examples. 
Suppose |G| = n. The wreath product of G by Z,, can be defined 
explicitly as follows: Viewing G as a subgroup of S',, let W be the 
subgroup of Sm» generated by Gx---x G (viewed naturally in Sn, 
cf. Exercise 9.7) and the cycle 
(Lnt1... (m=-1)n4+1)(2n+2 ... (m—-1)n+2)...(n 2n ... mn). 
Show that W is a group of order mn™. What group do we get 
for m= n= 2? 
In general, one can describe the wreath product of G by A as fol- 
lows, where m = |A|: By means of Cayley’s theorem, view A as 
a subgroup of S,, and thus as a group of permutation matrices, 
which acts by conjugation (via the usual matrix multiplication) on 


the set of “diagonal matrices with entries in G.” Of course, this 
description is rather imprecise, since in general one cannot define 
multiplication of matrices whose entries are in groups, but there is 
no difficulty in this instance because the matrix multiplication does 
not involve addition. 


Review Exercises for Part I 


1) True or false? (Prove or give a counterexample) The set of left invertible elements 
in a monoid forms a group. 


2) 
(i) What are the last two digits of p29 


(ii) What are the last two digits of 3” 2 

(ili) What are the last two digits of 6” 2 

3) Write down all the subgroups of Euler(n) for n=5,6,7,8,12. 

4) True or false? A group can be the union of two proper subgroups. 

5)Give an example of a group of even order, that does not have a subgroup of index 2. 


6) Give an example of a function f:M>N of monoids, such that f(ab)=f(a)f(b) for all a,b in M 
but such that f(e) # e Give an example where M,N are finite. 


7) Prove from scratch: In any group of even order, the number of elements of order 2 must be 
8) Compute the exponent of the following groups: D,,S, and A, for n=4,5,6,7,8 


9) True or false? (Prove or give counterexample) The product of two subgroups of Ds must be 
a subgroup. 


10) Prove that if |G|=2n for n odd then any two elements of order 2 in G cannot commute. 


11) The direct product of two cyclic groups of respective orders m and n is cyclic, iff m and n 
are relatively prime. 


12) Prove (G, X G,)/(N,;X N,) = (G,/N,) X (G)/N,) for N; < G, and N, < G>. 


13) Give an example of two nonisomorphic, nonabelian groups having the same order and 
the same exponent. 


14) True or false: If a=(2,1,4,4) in A= Z4 X Zg X Zg X Zs then <a> is a direct summand of A. 


1m 
15) Prove in SL(n, Z,) that the subset A={ 04 } with m in Z, is a cyclic subgroup. 


What is its generator? 

16) Give an example of a subgroup of an Abelian group that is not a direct summand. 

17) Prove: The Sylow p-subgroup of an Abelian group must be unique. 

18) Find a maximal Abelian subgroup of S¢ and write it as a direct product of cyclic subgroup 
19) An Abelian p-group is cyclic iff it has precisely p-1 elements of order p. 

20) In a group of odd order, can an element be conjugate to its inverse? 


21) Prove that every element in S, is conjugate to its inverse. 


26) Construct a nonabelian group of order 27 and exponent 3. 

27) Prove that there is a group injection from any group of order n into GL(,F). 

28) Prove: If G has a subgroup H of index 2, with |H| prime, then G is either cyclic or dihedral 
29) What happens to exercise 28 when |H| is not prime? 

30) What is the centralizer of (1 2 ...n) in S, ? 

31) True or false: If G is a solvable group and exp(G)= |G|, then G is cyclic. 

32) Prove: If |Gl=p' for p prime, then G has a subgroup of order p* for each 1<k<t. 

33) What are the Abelian groups of order 2000, up to isomorphism? 

34) Prove that all groups of order 1994 are solvable. 

35) Prove that any group of order 1995 has a normal cyclic group of index 3. 


36) List all the groups of order 175, up to isomorphism. 


PART II — 
RINGS AND POLYNOMIALS 


Having examined groups (and monoids) so thoroughly, we are ready to 
study sets with two operations — addition and multiplication. Such sets 
satisfying certain basic axioms are called “rings,” and turn out to be a 
beautiful illustration of the principle of abstraction. We start with Z, one of 
the most familiar objects in mathematics, and write down its basic algebraic 
properties. A few extra properties define a certain kind of ring (PID) in 
which one can prove virtually all the arithmetic properties of Z, including 
“unique” factorization into prime numbers. BUt there is another importnat 
example of PID — the ring F[z] of polynomials with entries in a given field 
F. The theory of fields is closely coonnected to roots of polynomials (for 
examle V2 is a root of the polynomial x? — 2); furthermore, considering 
the ring F[z] as a whole (rather than limiting ourselves to one particular 
polynomial) yields surprising applications to field theory, to be considered 
in Part III. In other words, the process of abstraction enables us to transfer 
basic arithmetical properties of Z to F[a], which then yield applications to 
field extensions of F. We shall pause along the way to note other instances 
of PIDs, and see some fascinating applications in number theory, which 
historically inspired the discovery of ideals (which play such a crucial role 
in ring theory). Also we shall take note of related rings that share some of 
the important properties of PIDs. 
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CHAPTER 13. AN INTRODUCTION TO RINGS 


In this Chapter we lay out the foundations of rings, focusing on domains 
and (skew) fields; in the next, we complement the discussion with the theory 
of homomorphisms and ideals. 


Definition 1. A ringis a set R together with binary operations + and - and 
elements 0 and 1, such that 

(i) (R, +, 0) is a group. 

(ii) (R,-, 1) is a monoid. 

(iii) Distributivity of multiplication over addition holds on both 
sides, i.e., a(b+c) = ab+ac and (b+ c)a = ba+ ca. 

1 is called the unit element of R. The ring R is commutative if (R, -, 1) 

is a commutative monoid. 


Usually the operations + and - are understood. The set {0} is a ring 
(taking 1 = 0), called the trivial ring. We shall only consider nontrivial 
rings in the sequel. Examples include: 

LZ; 

Zm, for any positive number m, where + and - are taken modulo m; 

Q, R, C, and so on; clearly, any field is a ring. 

(Note that all these examples are commutative. For a noncommutative 
example, see Exercise 1.) 

Other than Z, the most significant example for us is the ring of poly- 
nomials F'[x], to be defined in Chapter 16, which will be seen to share many 
properties with Z. We turn to Z for intuition and start off with some easy 
facts for all a,b in an arbitrary ring R. 


LEMMA 2. (i) 0=0a=<a0 ; 
(ii) (—a)b = a(—b) = —(ab). 
Proof. (i) 0a = (0+ 0)a = 0a + 0a, so 0 = 0a; likewise 0 = a0. 


(ii) 0 = a0 = a(—b+b) = a(—b) +ab, implying a(—b) = —(ab); likewise 
(—a)b = —(ab). 


Lemma 2 implies that 1 4 0 in any (nontrivial) ring. Indeed if 1 = 0 
then any r in R satisfies r= rl =r0=0. 


Domains and Skew Fields 

A skew field (or division ring) isa ring D for which (D\{0}, -) isa group, i.e., 
every nonzero element d in D has a left and right inverse. In other words, 
a skew field satisfies all the axioms of a field except perhaps commutativity 
of multiplication. Although our principal interest lies in fields, we consider 
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this more general situation as a very brief introduction to noncommutative 
techniques, to be continued in Appendix B. 

A domain is a ring R for which (R \ {0}, -) is a monoid, i.e., if a,b 4 0, 
then ab £ 0. A commutative domain is called an integral domain, in analogy 
to Z, the ring of integers. 


Remark 3. If R is a domain, then the monoid (R \ {0}, -) is cancellative. 
(For if ab = ac then a(b— c) = 0 in R, implying b-— c= 0, so b =c.) 


Remark 4. Using Theorem 1.7 we see that any finite domain is a skew 
field and any finite integral domain is a field; in fact, a deeper theorem 
of Wedderburn (Theorem 15 in Appendix B) says any finite skew field is 
commutative, so we see that any finite domain is a field. Since our interest 
in this part lies in integral domains that are not fields, further discussion 
of finite rings is postponed until Chapter 24. 


A subring of a ring R is a subset containing 0,1, that is a ring under 
the given + and - of R. To verify that T is a subring of R we need only 
check that (T+) is subgroup of (R,+) and that (T,-,1) is a submonoid 
of (R,-,1), for distributivity in T is a direct consequence of distributivity 
in R. This observation is enhanced in exercise 4. 

Clearly, any subring of a commutative ring is commutative, and any 
subring of a domain is a domain, cf. Exercise 5. In particular, any subring 
of a field is an integral domain. 

On the other hand, the integral domain Z certainly is not a field, so the 
question arises as to how to check in general whether an (infinite) domain 
is a skew field. To this end, for any a in R define Ra = {ra: r € R}. Then 
we have 


PROPOSITION 5. R is a skew field iff Ra = R for alO Aa€ R. 


Proof. (=>) If a# 0, then r = (ra~')a € Ra for any r in R. 
(<) Ifa £0, then 1 € Ra implying a is left invertible; hence R \ {0} 
is a group, by Lemma 1.8. 


Let us dwell on this result a bit. 
Left Ideals 


Definition 6. A left ideal of a ring R is a subgroup L of (R,+) satisfying 
the extra property 


ra€ JL for allr in R and a in L. 


The left ideal L is called proper if L 4 R. Right ideals are defined symmet- 
rically, using ar instead of ra. 
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Remark 6’. The left ideal L is proper iff 1 ¢ L. (Indeed, if 1 € L then 
r=rl€L forallr in R.) 


Example 7. For any a in R, Ra is a left ideal of R, called the principal left 
ideal generated by a. (Indeed rja + rea = (ri + r2)a € Ra, proving Ra 
is a subgroup of (R,+); also r(ria) = (rrija € Ra.) Note a = la € Ra. 
Also, Ra is the smallest left ideal containing a; for if D is any left ideal 
containing a, then ra € L for each r in R, proving Ra C L. 


PROPOSITION 8. R is a skew field iff R has no proper nonzero left ideals. 


Proof. (=) If L £4 0 is a left ideal take 0 #4 a € L. Then L D Ra = R by 
Proposition 5, so L is improper. 

(<) For any nonzero a in R we have Ra 4 0, so Ra = R by hypoth- 
esis; hence R is a skew field by Proposition 5. 


Here are some facts about combining left ideals that we shall need 
repeatedly. 


PROPOSITION 9. Suppose Ly, [2 are left ideals of R. 

(i) L1 N Le is the largest left ideal contained in both Ly and Lz; 

(ii) Ly + Lo = {a, + ag : a; € L;} is the smallest left ideal containing 
both Ty and Ly. 


Proof. (i) Ly N Lz is an additive subgroup; if r € R and a € £1 La, then 
ra € Ly andra € Lg sora € Ly NL , proving L1 M Lz is a left ideal. 
Obviously any left ideal contained in ZL, and Lz is contained in Ly M Lo. 
(ii) LZ, + Le is an additive subgroup; if r € R and a; € L;, then 
r(a, + a2) = ra, +rag € Li + Le, proving L; + Lz is a left ideal. If Lisa 
left ideal containing both ZL; and L2, then for any a; € L; we see ay+azg € L 
so [1 + Lg CL. 


Remark 10. To illustrate Proposition 9, suppose L, = Ra and Lz = Rb are 
principal left ideals. Then c € L, + Lz iff c= ra-+sb for suitable r,s in R. 
For example, 7Z + 5Z = Z, since 1 = —2-7+ 3-5. In general, one should 
see without difficulty that mZ +nZ = dZ, where d = gcd(m, n). 


The union of two left ideals need not be a left ideal; e.g., 3Z U 2Z 
contains 3 and 2 but not 3 — 2 =1. Nevertheless, there is a positive result 
along these lines. 


Remark 11. If L; C Lg C L3 ©... are left ideals of R, then so is Us>1 Li. 
(For if a € UL; then a € L, for some i implying ra € LD; C UL; for 
each r in R; likewise if a,b € UL; then a,b € DL, for some i, implying 
atbe I, Cc UL;.) 
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Exercises 


Rings of Matrices 


dl 


10. 


For any field F and any n > 1, the set of n x n matrices M,,(F) 
is a ring, under usual matrix multiplication and addition. More 
generally, let R be any ring and define M,,(R) to be the set of 
n X n matrices with entries in R, endowed with the usual matrix 
addition and multiplication. Show that this is a ring. (Hint: The 
easiest way to manage the computations is by using matric units, 
cf. Chapter 0.) Show that M,,(R) is neither commutative nor a 
domain, for n > 2. 


. Anelement e of aring R is called idempotent if e? = e. (For example, 


0,1 are idempotents, and these are the only idempotents if R is a 
domain.) Show that if e € R is idempotent, then eRe is a ring 
whose unit element is e. 

The set of continuous functions from R to R is a ring. Likewise for 
differentiable functions, and so on. 

Recall that a sentence in elementary logic is called universal if its 
normal form involves only the quantifier V. Prove that any universal 
sentence holding in a ring also holds in all of its subsets. In partic- 
ular, this holds for associativity and distributivity. Conclude that 
an additive subgroup FR of a ring T containing 1 of T and closed 
under the multiplication of T is necessarily a subring. On the other 
hand, give an example of a subset of Z, closed under addition and 
multiplication and containing 0 and 1, which is not a subring. 
Show that the following properties for a ring pass to subrings: being 
commutative; being a domain. 


Direct product of rings 

Define the direct product R of rings Ri,...,R, to be the carte- 
sian product R, x --- x R;, operations defined componentwise, and 
prove it is aring. If e; € R; are idempotents, then (e1,...,e¢) is 
idempotent in R; in particular, any element (0,...,0,1,0,...,0) is 
idempotent. Can R be an integral domain? 

The direct product of commutative rings is a commutative ring. 
Any left ideal of R, x Re has the form A; x Ag where A; is a left 
ideal of R; for i = 1,2. Similarly for right ideals. (Hint: Use the 
idempotents (1,0) and (0, 1).) 

Zinn © Lm X Zp as rings, if m,n are relatively prime. 

Unit(R, x--- x Ry) © Unit(R1) x--- x Unit(R,) as groups, for any 
rings Ri,..., Ry. (Hint: View R; as monoids.) 
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11. Use Exercises 9 and 10 to rederive Exercise 6.7. 

12. Generalizing Proposition 9, define the sum of an arbitrary number 
of left ideals L; : 1 € I to be the set of finite sums of elements taken 
from various L;; show that this is the smallest left ideal containing 
each [;. What is the largest left ideal contained in each L;? 


CHAPTER 14. THE STRUCTURE THEORY OF RINGS 


Although our emphasis in group theory was on finite groups, our attention 
here is mainly on infinite rings, because many of the rings of greatest interest 
to us are infinite, such as Z, Q, R , and C. (Finite fields also are of 
interest, and will be classified in Chapter 24.) However, the classification 
problem for infinite rings becomes so formidable as to be nigh impossible. 
In providing manageable partial results, the structure theory assumes an 
even more important role than before and pervades ring theory. After a 
while the structure theory takes on a form of its own, guiding us to the most 
profitable avenues of inquiry. Availing ourselves of the structure theory of 
groups, we can make use of the fact that every ring is an Abelian group 
(under +), and the transition to ring theory is easy. Thus, at the outset, 
we pass through the following stages: 
(1) homomorphism, 
) kernel of homomorphism © ideals (analogous to normal subgroups), 


4) structure of R/A in terms of R, and 


(2 
(3) residue ring = factor ring R/A where A is an ideal of R, 
( 
(5) Noether isomorphism theorems. 


We shall also give the ring-theoretic analog of Cayley’s theorem. 


Definition 1. A ring homomorphism yp: R > T is a group homomorphism 
(under +) that is also a monoid homomorphism (under - ). A ring isomor- 
phism is a ring homomorphism that is 1:1 and onto. 


Remark 2. By Remark 4.2, it suffices to check the following properties for 
all a,b in R, in order to verify that gy: R > T is a ring homomorphism: 
(i) plat b) = g(a) + v(b); 
(ii) p(ab) = pla)p(>); 
(ii) g(1) = 1. 
Condition (iii) is not redundant, by the example in Digression 4.3. 
Much of the structure theory can be obtained readily from the theory of 
Abelian groups, since ring homomorphisms are also group homomorphisms 
(of the additive structure). Thus, we have 
Remark 2. If g:R > T is a ring isomorphism, then ¢7!:T > R is also 
a ring isomorphism. Indeed, by group theory g~+ is an isomorphism from 
(T,+) to (R,+), so we need only check conditions (ii) and (iti) for got. 
(iii) is obvious; to check (ii) for a,6 in T we let u = g~+(a) and v = gy 1(b) 
and note that 


py *(ab) = g *(y(u)e(v)) = g*y(uv) = ww = gp (ale *(8). 
Typeset by A,yS-TEX 
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Another instance of this philosophy: 


Remark 3. If g:R > T is a ring homomorphism, then y(R) is a subring 
of T. (Indeed, we already know y() is an additive subgroup; furthermore, 
1p = p(1p) and y(ri) (72) = v(rir2) for r; in R.) 


Now we turn to ring homomorphisms that need not be isomorphisms. 
Again drawing from group theory, we define the kernel of y, denoted ker y, 
to be 7 1(0), and recall that y is 1:1 iff kerg@ = 0. Furthermore, ker g is a 
subgroup of (R,+), and is a left and right ideal since for any a € ker y and 
rin R we have 


pra) = v(r)p(a) = v(r)0 = 0 
Oy(r) = 0. 


lI 
6 
— 

Q 
ee 
6 
— 

3 
oe 

ll 


and (ar) 


As with groups, a 1:1 ring homomorphism y: R > T is called an injection 
(of rings), and enables us to view R as a subring of 7. 


Ideals 
Let us now describe the structural properties of the kernel of a ring homo- 
morphism and define the ring-theoretic analog of a normal subgroup. 


Definition 4. An ideal A of R is a left and right ideal. 


Clearly, 0 and R are ideals of R, called the trivial ideals. Thus, “non- 
trivial” means “proper nonzero.” Any ideal of R containing 1 is all of R, by 
Remark 13.6’. Hence a proper ideal cannot be a subring, thereby breaking 
the analogy with groups. This failure has caused several authors (most 
notably Herstein) to discard the unit element 1 from the definition of a 
ring (since then every ideal would be a subring). However, this complicates 
the theory in several key places such as Remark 13.6’ itself, and the stan- 
dard practice has come to require 1 € R in the definition. Of course, in a 
commutative ring, any left ideal is an ideal. Thus we have 


Note 4'. Proposition 13.8 says a commutative ring R is a field iff R has no 
nontrivial ideals. 


Let us consider this result more carefully, especially in view of Exer- 
cise 2, which shows that M,(F) is not a skew field (and even has zero- 
divisors) whenever n > 1, but has no nontrivial ideals, for any field F. If 
we scratch beneath the surface, we uncover one of the great dilemmas of 
noncommutative ring theory: Although the structure theory of rings in- 
volves ideals, an important role is also played by left ideals, and left ideals 
often are more manageable than ideals. (Exercise 1 indicates the difficulty 


of computing with ideals.) The way this dilemma was resolved in Note 4’ 
was that we worked with commutative rings, in which left ideals and ideals 
are the same. In general the theory of commutative rings is much more 
accessible than the theory of noncommutative rings, so after this Chapter 
we shall concentrate on commutative rings, postponing the treatment of 
noncommutative rings until Appendix B. 

Since our study of commutative rings will hinge on their ideals, the 
case of fields is in some sense “trivial;” we shall be interested here in rings 
which are not fields. In analogy to normal subgroups, we write AR to 
denote that A is a proper ideal of R. 


Remark 4". Many assertions for left ideals also hold analogously for right 
ideals and thus for ideals. For example, the sum A+ B of two ideals A, B is 
an ideal, and is the smallest ideal containing A and B (cf. Proposition 13.9). 
Likewise the intersection of ideals is the largest ideal contained in each of 
them. In general the union of ideals is not an ideal, but if Ay C Ao C... 
are ideals of R, then U,., Ai «R, by Remark 13.11. 


Our next task is to show that every ideal is indeed the kernel of a 
suitable ring homomorphism. 


Definition 5. If As R, define R/A = {r +A: 7 € R}, which is given the 


additive group structure of the residue group and multiplication 


(ry + A)(T2 + A) =Srytrot A. 


THEOREM 6. Let Ad R. Then R/A as defined above is a ring, with unit 
element 1+ A. There is an onto homomorphism y:R > R/A given by 
p(r) =r + A, and ker yp = A. 


Proof. Multiplication in R/A is well-defined, since if r, + A= rj + A and 
ro + A=r), 4+ A, then writing ri =r; + 4; for a; in A we see 


nits t+ A=(ri +41)(r2 +02) +A 
=7ryrot+ (71a tayro+ 4103) + A= T1To + A. 


Associativity and distributivity are easy to verify in R/A, as a conse- 
quence of the respective axioms in R. Moreover, 


(1+ A)(r+ A) =r+ A= (r+ A)(14 A), 


so 1+ Ais the unit element of R/A. We already know ¢ is a group homo- 
morphism with respect to +, and p(1) =1+ A, and 


p(ri)p(t2) = (1 + A)(t2 + A) = 7172 +A = (T1792), 


proving y is a ring homomorphism. From group theory, gy is onto and 
kerrp=y 1(0)=A.0 


Here is a useful result, which transfers the ideal structure from R 


to R/A. 


PROPOSITION 7. Let AdR. If A C B«R then B/AsR/A. Conversely, every 
ideal of R/A can be written uniquely in the form B/A, where AC B«R. 


Proof. We know B/A is an additive subgroup of R/A, and we check that 
(r+A)(b+A) = rb+A € B/Aand (b+A)(7+A) € B/A for all bin B andr in 
R. Conversely, if 14R/A we know from group theory that J = B/A, where 
B is uniquely determined as the preimage of J in R, and B is an additive 
subgroup of R. But ifb+A € J andr € R, then rb+A = (r+A)(b+ A) € I, 
and likewise br + A € I, implying rb € B and br € B,so BAR. O 


This has an immediate consequence. Let us say a proper ideal A is a 
maximal ideal (in R) if no proper ideal strictly contains A. For example, 
3Z is a maximal ideal of Z. 


COROLLARY 8. Suppose R is a commutative ring and A<R. A is a maximal 
ideal iff R/A is a field. 


Proof. Apply Note 4' to Proposition 7. 0 


Noether’s Isomorphism Theorems 
Let us turn now to ring-theoretic analogs of the Noether isomorphism the- 
orems for groups. 


LEMMA 9. Suppose y: R > T is any ring homomorphism, and A< R with 
A C kery. Then there is a ring homomorphism ¢:R/A > T given by 


P(r + A) = v(r), and ker g = (ker y)/A. 
Proof. @ is an additive group homomorphism with the correct kernel, by 


Lemma 5.14; in fact, ¢ is a ring homomorphism, since 


P((71 + A)(r2 + A)) = G(rita + A) = (rita) = (ri) e(r2) 
= @(r1 + A)G(r2 +A). O 


THEOREM 10. NOETHER I. If gy: R > T is an onto ring homomorphism 
then T = R/ ker ¢. 


Proof. Asin Theorem 5.16. 0 


Example 11. Z,* Z/nZ as rings (compare with Example 5.16’). 


THEOREM 12. NoETHER II. R/B x (R/A)/(B/A), for any ideals A C B 


of R. 


Proof. Asin Theorem 5.17. 0 


Noether III does not carry over nicely to rings, since an ideal of a ring 


is not a subring. 


Exercises 


Li: 


Recall that the smallest left ideal containing a is Ra. Show that the 
smallest ideal containing a is ‘aaa ras; 1k EN, 17,5; € Rh, a 
much more complicated object. 


. Any proper ideal of M,(R) has the form M,,(A) where A<R. In 


particular, if D is a skew field, then M,,(D) has no nontrivial ideals. 
On the other hand, M,,(D) has nontrivial left ideals for n > 1. 


The Regular Representation 


. Suppose ($,+) is an Abelian group. Then, replacing Map(S;, S) 


(of Exercise 9.9) by the set of group homomorphisms from S$ to 
S, denoted as Hom(S,5), show that Hom(S,S) is a submonoid of 
Map(S, $). In fact, Hom(S, S) is a ring, where addition is given by 
(f + 9)(s) = f(s) + 9(s). 


. If S is a ring then there is a ring injection S > Hom(S,S) given 


by ar £,, where (4:5 3 Sis “left multiplication by a,” given 
by £.(s) = as. This injection is called the left regular representation 
(cf. also Exercise 21.15). Note the parallel to Cayley’s theorem from 
group theory. If we took the right multiplication map rq instead of 
the left multiplication map £,, we would have an anti-injection, in 
the sense that multiplication would be reversed, 7.€., Tap = ToTa- 


. For any group ($,+), any additive subgroup of Hom(S, S$) closed 


under composition of maps and containing the identity map S > S 
is a subring, by Exercise 13.4. 

As with Cayley’s theorem, one uses Exercise 13.5 to prove that 
a given additive group S is indeed a ring, by displaying it as a 
subring of Hom(S, S$), cf. Exercise 16.3. 


General Structure Theory 


. If g: R > T is a ring homomorphism, with R commutative, then 


p(R) is a commutative ring. 


. State and prove that the analogs of Proposition 13.9 and Remark 


13.11, for ideals instead of left ideals. 


. (Abstract Chinese Remainder Theorem) Ideals Ay,..., Az are called 


comaximalin Rif A; + A; = R for all 1 ¥ 7. In this case prove 


t 
R/(\ Ai R/Ay x +++ x R/Ap. 


i=1 


(Hint: Define g: R > R/A,x---x R/A; by re (r+Aq,....7+ At). 

One needs ¢ onto, or, equivalently, each (0,...,0,1,0,...,0) € y(R). 
Fixing 2, write 1 = a;; + 4; for aj; € Aj, a; € Aj. For convenience 

take 7 = 1. Then 1 = [J,7,(a1; + aj) = a+ a2... a4 for suitable 

a in Ay; consequently y(a,...a4) = (1,0,...,0).) 

. (Chinese Remainder Theorem) Using Exercise 8, show that for 

any relatively prime numbers n,,...,+ and any integers a,,..., a+, 

there is a € Z such that a= a; (mod n;), 1<i<t. 


CHAPTER 15. THE FIELD OF FRACTIONS — A STUDY IN GENERALIZATION 


We have seen that any subring of a field is an integral domain, and every 
finite integral domain is a field. On the other hand, although Z is an integral 
domain that is not a field, Z is a subring of the field Q. This leads us to 
ask whether every integral domain need be a subring of a suitable field. 
Actually, the standard construction of Q from Z can be modified to yield 
a positive response to this query. 


Construction 1. The field of fractions. Let R be any integral domain, 
and S = R\ {0}. Define an equivalence relation ~ on R x S by “cross- 
multiplication,” 7.e., 
(71,81) ~ (72,82) if 118. = 1981. 
Reflexivity and symmetry of this relation are apparent; to verify transitivity 
suppose that r,s. = 728, and 7283 = 7382; we need to check 7183 = 738}. 
But 
(1183) 82 = (7182 )83 = 128183 = 128381 = 738251 = 738182, 

80 7183 = 7381, seen by canceling sz on the right. Now write = for the 
equivalence class of (r,s) in R x S, and define R[S~*] (also denoted S-'R 


and Rg in the literature) to be the set of equivalence classes £, where r € R 
ands € S. 


Remark 2. 22 = x for all s in S. In particular, 2 = ¢ (taking r’ = 0, 


s' =1), and * = + (taking r! = s’ = 1). 
Next, we define operations 
Tay 4 12 es 1189 +1281 
81 82 8182 
ryt rr 
and ee 
81 89 8189 
(These are both in R[S~!] since (S,-) is a monoid.) Let us show that these 


operations are well-defined. Suppose 


/ I 

r r r r 

1 1 2 2 
—-=— and ~=—; 
Sy 81 8» 82 

then r}s; = 11s} and ris. = 128), so 

tot rot ed AY ' ' ' _ rot rot 

(1185 +7584 )$182 = 711818582 +15 828181 = 11818989 + 72858181 


= (7182 + 7281)84 85, 


since (17415 )$182 = 14811582 = 11841285 = (T172)8) $9. 
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Proposition 3. R[S~'] is a ring, and there is a natural ring injection 
:R— R[S~] given by (r) = ©. 


Proof. Associativity (both of addition and of multiplication) and distribu- 


tivity of multiplication over addition are checked easily, and we see at once 


r 


that ¢ is the 0 element and + is the unit element. Finally — = —2, since 


r —r_ 0 _4d0 7 - . 
24 = 4 = 7, by Remark 2. ~ is a homomorphism since 


rtr 


wrt) = PEP TT Lwin) 4 wer), 
worn!) = aE = (nye) 
w= 7 


Finally, r € ker a iff | = 4, iff r-1=0-1=0, implying kery) = 0. O 


PRoposITION 4. With notation as above, R[S~'] is actually a field. 


Proof. Suppose = # 4. Then r # 0 (by Remark 2), so r € S, implying 


£ € R[S~"), and, clearly, (£)(4) = = =+.0 


r rs 1 


We separated Propositions 3 and 4, since Proposition 3 soon will be 
generalized. 


Intermediate Rings 

Thus far, we have succeeded in finding an injection from an arbitrary in- 
tegral domain R to the field R[S~'], where S = R\ {0}. The reader may 
have noticed however that in the proof of Proposition 3 we have used only 
the following properties of S: 


1. The elements of S commute with all elements of R; 
2. Sis a multiplicative submonoid of R; 
3. Cancellation works in R for elements of S' (needed for transitivity). 


Thus, we can reformulate Proposition 3 as 


PROPOSITION 6. Let R be any commutative ring, and let S be asubmonoid 
such that rs # 0 for every 0 #r € R ands € §. Then R[S~+] as con- 
structed above is a ring, and there is a natural injection ~):R > R[S~1) 
given by rt ¢. 


See exercise 3 for a further generalization. 


Remark 6'. Given A C R, let us write AS”! for {4: a € A, s € S}. If 
AR, then clearly AS”! <« R[S~*]. The converse to this assertion is given 
in Exercise 10. 


Now we come to a rather delicate issue. Suppose that we already have 
a ring containing R, in which every element of S is invertible. How does 
this compare with the abstract construction R[S~']? For example, there 
are various ways of viewing Q. How do we compare these with the abstract 
construction? The answer to this question takes us to the key property of 


R[S@?]. 


PROPOSITION 7. Suppose R,T are commutative rings, and S C R is as 
in Proposition 6; suppose y:R > T is a homomorphism of commutative 
rings, such that p(s) is invertible in T for every s in S. Then ~p extends to 
a homomorphism ¢: R[S~'] > T given by 


A(E) = elr)p(s). 


and ker @ = (ker g) St. 


Proof. 
afl mt = 
— =) = o(ri)e(s1)! + g(r2)(s2) 
81 82 
= (y(71) (82) + v(r2)(s1))((s1) (82) *) 
a ees t 
= p(riss +1281) 9(s182) ' = @(— + =); 
Ss 82 
srl _ 
Az) = eed) =1: 
shi eek 7 7 
e a e = = (11) (81) 'p(r2)p(s2) 
Tit 
= p(rits)p(s1s2) | = g(——): 
81 82 
proving ¢ is a homomorphism. Now * € ker@ iff y(r)y(s)~! = 0, iff 


p(r) = 0, so ker ¢ = (kery)S 1. O 


COROLLARY 8. Notation as in Proposition 6, if R is a subring of T, then 
R[S~"] is canonically isomorphic to {rs~+:r € R, s € S} taken in the 
ring T. 


Proof. Let p:R — T be the natural injection. Then, with notation as 
in Proposition 7, ¢:R[S~'] > T is also an injection, and thus R[S~'] is 
isomorphic to its image in T, which is {rs~!:r € R,s € S}.0 

Example 9. Any subring of Q contains Z, since n = 1+---+ 1, taken n 


times. Thus, any subfield of Q contains the field of fractions of Z, which 
can be identified naturally with Q, so the only subfield of Q is Q itself. 


Next one could ask, “Is there a ring W such that ZC W C Q ?” To 
answer this, we can try W = Z[S~+] when S$ is any submonoid of Z, for we 
can view W C Q by Corollary 8. Here is one instance when W lies properly 
between Z and Q. 


Example 10. Take s > 1 in N and let S = {s* : i € N}. Write Z[4] for 
Z[S~']. Then Z C Z[z] since s~1 ¢ Z. Also if s|t, then Z[2] C Z[4], and 
the inclusion is certainly proper if t does not divide any power of s (for if 
1 


+ = 4% then s' = tn). In particular, we have an infinite chain 


Zz c 25] C zi- c Zls71 c Zs] Gin: 


These results are examined further in Exercises 4ff. 


Exercises 


1. Write the details of the proofs of associativity and distributivity for 
Proposition 3. 

2. ZIN\ {0} & Z[Z\ {0} @ 

3. Suppose S$ is an arbitrary submonoid of a commutative ring R. 
Define ~ on RX S by (71,81) ~ (72, 82) if 71898 = 72818 for some s in 
S. This is an equivalence relation, and generalizing the construction 
in the text one can build R[S~+] together with a homomorphism 
R — R[S~"] whose kernel is {r € R : rs = 0 for some s in S}. 
(In particular, if 0 € S then R[S~1] = 0.) Prove the analog of 
Corollary 8. 


Subrings of Q 

4. If pis a prime number then there is no ring W properly contained 
between Z and Z{-]- 

5. For any prime number p define P = pZ and Zp = {@ : p does not 
divide n}. Write this in the form Z[S~"], for a suitable submonoid 
S of N. 

6. With notation as in Exercise 5, show that Zp has the unique max- 
imal ideal pZ p = {= : p does not divide n}. 

7. Every subring of Q has the form Z[S~"] for a suitable submonoid 
S of Z. (Hint: Arguing as in Exercise 4, show that S' is the set of 
denominators of fractions expressed in lowest terms.) 

8. Given any finite set So of distinct prime numbers {p1,...p:}, let $ 
be the submonoid of N “generated” by So; i.e., S is comprised of 
those natural numbers of the form {p7"!...p7?"t : pi; € So, m; € N}. 
Prove Z[S~'] = Z[4] where n =p, ... pe. 


11. 


12. 


Every subring of Q can be obtained by taking a suitable set So 
of distinct prime numbers (not necessarily finite), letting S be the 
submonoid of N “generated” by So, and forming Z[S~']. (Hint: 
Start with Exercise 7.) 


. Assumptions as in Proposition 6. Every ideal of R[S~1] has the 


form AS~! for suitable AR. 

Weaken the hypothesis in Proposition 7 to R commutative and S a 
submonoid of R, cf. Exercise 14.6 

Propositions 6 and 7 and Corollary 8 remain valid, if we replace 
the condition R commutative by “sr = rs for all r in R, s in S.” 
Furthermore, under this condition, one need not assume that T is 
commutative. (There is a further weakening of this condition for 
noncommutative rings, which is outside the scope of these notes.) 


CHAPTER 16. POLYNOMIALS AND EUCLIDEAN DOMAINS 


As indicated in the introduction to Part II, some of the most beautiful re- 
sults of algebra involve polynomials. Upon first acquaintance, a polynomial 
seems rather ethereal — a formal expression involving an indeterminate z 
and coefficients from a given ring R — and one might expect that we must 
substitute some value for z in order to obtain meaningful results. However, 
it turns out that the collection of all these polynomials can be given the 
structure of a ring having many nice properties which are inherited from R, 
and in this sublime transition from chaos to algebraic structure, « becomes 
a very meaningful element of the new ring. 

The polynomial ring over a field shares several key properties with Z, 
one of which is the “Euclidean algorithm,” which enables us to divide one 
element into another and check whether the remainder is 0. This property 
is so important that we use it to define a class of rings, called Euclidean 
domains, our first common generalization of polynomial rings and of Z; we 
conclude this section by developing the basic properties of number theory 
in these rings. 

To start with, we must settle on the “correct” definition of polynomial 
with coefficients in a ring R. In all innocence one might write down a 
typical polynomial, such as 22° + 32 +7, but we must be careful, since 
then Ox? + 223 + 32 +7 certainly should be considered to be the “same” 
polynomial. To avoid complications involving equivalence relations, we 
define instead a polynomial (over R ) to be a formal infinite sum >) <1 aj", 
where each a; € R and almost all a; = 0. The nonzero a; are called the 
coefficients of the polynomial. Thus, 27° + 32 +7 is considered shorthand 
notation for en a,z' where a) = 7, a, = 3, d2 = 0, a3 = 2, and a; = 0 
for all 7 > 4. 


The Ring of Polynomials 
Write R[«] for the set of polynomials over R. Defining addition com- 
ponentwise, i.e., > ajx* + > byx? = So(a; + 6;)2", we have 


Remark 1. (R[az],+,0) is an Abelian group, where 
0= S. Oz' and — o> ax") = So (-ai)a". 


It remains to define multiplication of polynomials. Clearly, we want 
zx) = 2’*), and thus 


o> aja" )()> bjat) = oa aybja'*4, 


tJ 
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However, the expression on the right is not written as a polynomial, 
since each power of x does not have a unique coefficient; instead we note 


uu 
Saibja — OS Ajby—i)2", 
ag wu  i=0 


and thus define formally 


(So ain) (So 5j07) = Yee”, (1) 


a,b; 18 calculated in R. 


where ¢, = >.” 


i=0 
PROPOSITION 2. Suppose R is a ring. Then R[z] is a ring under the given 
addition and multiplication, and is commutative if R is commutative. 


Proof. The ring axioms are routine to check, where the unit element is 
1+ 02+ 02? +...; associativity of multiplication is the most cumbersome 
and is left to the reader, cf. Exercise 1. More streamlined approaches are 
given in Exercises 2 and 3. 0 


Digression. One might wonder what would happen if we dropped the con- 
dition for 37; a,x’ that almost all the a; are 0, i.e., if we dealt with “power 
series” instead of polynomials. Surprisingly, one still winds up with a ring, 
which is very useful and is described in Exercises 7ff. 


Now that we have defined R[z], let us study some of its properties as 
aring. The key is the degree of a nonzero polynomial 5+ a,x", defined to be 
the largest number u for which a,, 4 0; a, is called the leading coefficient. 
We say f is monic if its leading coefficient is 1. The degree of a polynomial 
f is denoted as deg f. For example, if f = 22° + 32 + 8, then deg f = 3. 
A polynomial of degree 0 is called a constant polynomial. 


Remark 3. There is an injection R > R[z] given by ri r+0r+027+...; 
in this way we can identify R with the constant polynomials. 


Remark 4. Suppose R is a domain, and 0 # f,g € R[a]. A glance at (1) 
shows fg # 0, and 
deg(fg) = deg f + deg g. (2) 


Coro.tiary 5. If R is a domain, then R[x] is also a domain. 


Example 6. Z[a]is an integral domain. F[s] is an integral domain, for any 


field F. 


Now recall the Euclidean algorithm for Z (which says that for any 
numbers a,b # 0 there are integers g,r (for “quotient” and “remainder” 
respectively) with 0 <r < 6, such that a = bq +r. 


PROPOSITION 7 (EUCLIDEAN ALGORITHM FOR POLYNOMIALS). Suppose 
R is any ring, and f,g € Rx], with g monic. Then there are polynomials 
q,7 in R[x] such that f = qqg4+-r, where cither r = 0 or degr < deg g. 


The proof is obtained by examining division for polynomials, learned 
(without proof) in high school, as exemplified by 
227 + 62 +21 

x —3 )293 +02? + 32 +7 
22° — 62? 
6x? + 3x 
627 — 18% 
2la2+7 
212 — 63 
70 
where here f = 22° +3247 and g=2z—3. 


Proof of Proposition 7. Induction on n, where n = deg f and m = deg g. 
If n < m, then we are done, by taking g = 0 and r = f, so assume n > m. 
Let h = f — az” g, where a is the leading coefficient of f. The leading 
terms of f and az"~™g are each ax”, and thus cancel each other in h, so 
deg h <n. By induction on n we can write h = qg+r where degr < deg g; 
hence 
f=h+an™ = (qtar" ™)g +r, 

which has the desired form. 0 


Euclidean Domains 

Having been introduced to the Euclidean algorithm for polynomials, we are 
ready for a major breakthrough — namely, we use the Euclidean algorithm 
to define a common generalization of Z and F[z]. Then, proving the basic 
theorems of arithmetic (well-known for Z) in this more general setting, we 
shall have them instantly for F[z]. 


Definition 8 A Euclidean domain is an integral domain FR together with a 
“degree function” d: R \ {0} > N satisfying: 
(Eucl) d(a) < d(ba) for any a,b in R \ {0}: 
(Euc2) for any a, in R\ {0} there are q,r in R such that 
b=aq+tr, 
with either r = 0 or d(r) < d(a). 


Example 9. (i) Z is Euclidean, when we take d(a) to be |a|; indeed, (2) is 
the familiar Euclidean algorithm for integers, and (1) follows from the fact 
that |b| > 1 for any integer 4 0 (for then |ab| = |a||b| > lal). 


(ii) F[z] is Euclidean for any field F, where d( ) is the degree of 
the polynomial. Indeed, we have seen that F[z] is an integral domain. 
d(fg) = d(f)+d(g) > d(f) for any polynomial g # 0, yielding (Eucl); 
(Euc2) is the Euclidean algorithm for polynomials. (Since F is a field, 
we may divide through by the leading coefficient of the polynomial g and 
assume g is monic.) 


The case r = 0 leads us to the following basic notion. 


Definition 10. a “divides” b, written alb, if b = ga for some g in R, or, 
equivalently, if b € Ra. We say a is an associate of b if Ra = Rb, 1.e., if 
a|b and b|a. 


Remark 10’. In an integral domain R, if a and b are associates then a = ub 
for some invertible «in R. (Indeed, write a = ub and 6 = va. Then 
a= u(va), implying uv = 1.) 


Remark 11. If A= Ra in a Euclidean domain R, then d(b) > d(a) for every 
046€ A. (Indeed, write b = qa for q € R, and apply (Euc1).) 


Consequently, if Ra = Rb, then d(a) = d(b). The converse also holds. 


PROPOSITION 12. Suppose R is a Euclidean domain with degree function d, 
and Ad Rk. If a has minimal degree of all nonzero elements in A, then 


A- Ra. 


Proof. For any 0 #6 € A, write b = qa+~r for r = 0 or d(r) < d(a). But 
ry = b—gqa € A since 6 € A and ga € A, so we cannot have d(r) < d(a). 
Hence r = 0.0 


Corotiary 13. Ifb € Ra and d(a) = d(b) then Ra = Rb. 


Our next corollary is the key to the theory, since it gives the connection 
between the degree function (which is rather far removed from the abstract 
notion of ring) and the ideals of the ring. In fact, one has 


COROLLARY 14. In any Euclidean domain R, if A<« R, then A = Ra for 
suitable a in R. 


Proof. Take 0 4 a € A of minimal possible degree. 0 


Let us turn to the building blocks of arithmetic. The invertible ele- 
ments are “too good,” so we usually disregard them. 


Unique Factorization 


Definition 15. A noninvertible element p 4 0 of a ring R is trreducible if 
whenever p = ab we have a or 6 invertible. 


Our first goal is to obtain the factorization of an arbitrary element into 
irreducibles. 


PROPOSITION 16. In a Euclidean domain, any noninvertible element r ~ 0 
can be factored into irreducibles. 


Proof. By induction on d(r). The result is obvious unless r is reducible, 
i.e., r = ab for a,b not associates of 7. By Corollary 13, d(a) < d(r) 
and d(b) < d(7), so, by induction, each of a and 6 has a factorization into 
irreducibles; putting these together yields the desired factorization for r. 0 


Now we turn to uniqueness of the factorization. We say two factor- 
izations into irreducibles, r = p,...p, and r = qy...qm, are equivalent if 
m = k and there is a permutation 7 of {1,...,4} such that g,; and p; 
are associates for 1 <i< k. The element r has unique factorization if all 
factorizations of 7 into irreducibles are equivalent. 

Since “uniqueness” is only up to associates, it would make more sense 
to deal with the monoid of equivalence classes of elements (under asso- 
ciates), which, in fact, is what we do in practice, although we rarely admit 
it in public. (For example, we normally deal with Z* instead of Z, taking 
the positive prime number p as the representative of the class {p}.) Thus, 
when we talk of “the” irreducible p in a factorization we really are referring 
to its class of associates. To analyze the uniqueness of factorizations, we 
need a new concept. 


Definition 17. A noninvertible element p € R is prime if it satisfies the 
following property: If p|ab then pla or plo. 


Remark 18. In an integral domain, any nonzero prime p is irreducible. 
(Indeed, suppose p = ab; then alp and blp. But also plab, implying pla or 
p|b, so a or b is an associate of p.) 


The point of this definition is seen in the next lemma. 


LEMMA 19. If a prime p divides r,...T,, then p divides some r;; if fur- 
thermore r; is prime, then p and 7; are associates. 


Proof. By definition p|ry ...7m—1 or p|?m, so the first assertion follows by 
induction on m. The second assertion is clear, by Remark 18. 0 


PROPOSITION 20. In an integral domain, all factorizations of a given ele- 
ment r into primes are equivalent. 


Proof. Induction on k, where we write r = p,...p; with each p; prime. 
Suppose r = q@ ...Gm with each qj prime. Then p;, divides some q;; since 


primes are irreducible, we see p; is an associate of q;. Writing qj = up; we 


have 

Pi. -Pk—-1 = UG... Gj—-19j41--- Im: (3) 
Noting wq, is prime, we apply induction on & to see that the factorizations 
in (3) are equivalent: thus k— 1 = m—1, and uqi.q2,.--.4j—1, Qjtis-+ ++ 4m 
is a permutation of associates of p,,...,pz—1. Then we are done, since qy 


and wq, are associates. O 


To obtain unique factorization in Euclidean domains, it remains to 
show that every irreducible is prime. One way is to translate the definitions 
of “irreducible” and “prime” to the structure of rings. We say a divisor a 
of b is proper if a is not an associate of b. 


Remark 21. (i) An element p is irreducible, iff every proper divisor of p is 
necessarily invertible. In other words, p is irreducible iff whenever Rp C Ra 
one necessarily has Ra = R. 

(ii) An element p is prime iff whenever ab € Rp we have a € Rp or 
6b € Rp (1.e., iff R/ Rp is an integral domain). 


PROPOSITION 22. In a Euclidean domain R, any irreducible element p is 
prime. 


Proof. Otherwise R/Rp is not an integral domain, and thus not a field, so 
by Proposition 14.7 R/Rp has a proper ideal A/Rp # 0, i.e... Rp C AAR. 
But then A has the form Ra by Proposition 12, contrary to Remark 21(i).0 


Let us put everything together. 


Definition 23. A unique factorization domain (UFD) is an integral domain 
for which each noninvertible element has unique factorization. 


THEOREM 24. Every Euclidean domain is a UFD. 
Proof. Combine Propositions 16, 20, and 22. 0 


It is easy to see that unique factorization is the key to many results 
about number theory. For example, we can compute the greatest common 
divisor of a and 6 as the product of those irreducibles (counting multiplicity) 
that are common to the factorizations of a and b: a and 6 are relatively prime 
precisely when they have no common irreducible divisors. The following 
observation is critical: 


Remark 25. Ina UFD, every irreducible element is prime. (Indeed, we want 
to show that a given irreducible p is prime. Suppose p divides ab. Then 
p appears in the factorization of ab into irreducibles. But by uniqueness this 


is the product of the respective factorizations of a and b into irreducibles; 
hence p (or an associate) appears in one of these factorizations, and so 
divides a or b.) 


This remark can be strengthened to characterize UFDs cf. Exercise 
21. An application: 


Example 26. Suppose ab = c” in a UFD, where a and b are relatively prime. 
Then a = uc? and b =u ‘c}, where w is invertible and ¢,,¢2 are relatively 
prime. (Proof: Any prime p dividing @ must appear with multiplicity 
divisible by n, since p divides c” and does not appear in the factorization of 
b. Thus a and 6 are associates of nth powers, i.e., a = uc} and b = ve} with 


”~ and comparing factorizations 


u,v invertible. Hence c” = ab = uv(cre2) 
we see c and cic. are associates. Write ¢ = weyc. where w is invertible; 


replacing c, by we, we may assume w = 1. But then wy =1, s0ov=u7?.) 


Exercises 


1. Verify the ring axioms for R[x]. (Hint: The hardest verification is 
associativity of multiplication, which follows from 


uu 


(do aia)(S2 b;27))\() cea*) = OID 5 aidud2")( 5 x2") 


u  i=0 


= (DIE DV aibaseret 


v u=0 i=0 


2. (Another way of writing polynomials) Write the infinite vector (a;) 
for the polynomial >> a;7;. Then the ring operations in R[a] become 
(a;) + (b;) = (a; +b;) and (a;)(b;) = (cu), where ey = Oj-4 Gibu—i. 

3. Prove slickly that R[z] is a ring, by means of the regular representa- 
tion (Exercise 14.4): Identify R[z] as a subring of Hom(R[2], R[z]). 

4. Describe explicitly the field of fractions of F[z], where F is a field. 

5. Give an example in a Euclidean domain R where d(a) = d(b) but 
Ra # Rb. 

6. In a Euclidean domain, an element b is invertible iff d(b) = d(1). 


Formal Power Series 
Exercises 7 through 20 introduce formal power series, and illustrate 
some of their varied applications. 

7. The ring of formal power series R{[x]] is defined as the set of all 
formal (infinite) sums )>,<) i2", where each a € R, with addition 
defined componentwise, i.e., 5 aja + >> bja* = S3(a; + b;)2", and 


10. 


1. 


12. 


13. 
14. 


15. 


multiplication given via the formula 


where ¢, = Soi, dibu—i is calculated in R. Show that R[[s]] is 
indeed a ring, and R[s] is a subring of R[[z]]. Arguing by means of 
the lowest order term, show that R[[z]] is an integral domain if R 
is an integral domain. 

The point of working in R[[2]] is that often one has a concrete 
way of describing the inverse of a polynomial. 
(1-2) '=14+2+427+...,in R[[z]]; more generally if f € R[[z]] 
has constant term 0 then (1— f)"'=14+ f+ f?+.... (The point 
here is to explain why the right-hand side makes sense.) 
If F is a field then any power series g in F[[]] with nonzero constant 
term is invertible. (Hint: Write g = a(1—f). ) 
Noting that Ears) = ras Oe - aa) give an explicit formula 
(over C) for the power series corresponding to the inverse of a qua- 


dratic polynomial having nonzero constant term. 

As in exercise 2, any infinite sequence (a9,a1,...) corresponds to 
the formal power series f = a9 t+ a,2+.... In particular, take f 
corresponding to the Fibonacci series (1,1,2,3,5,8,...), and note 
f=ltafta2’f. Hence f = (1 — x —27)71; use Exercise 10 to 
produce a closed formula for the Fibonacci coefficients. 

Given a power series f = So jen a,x", define f(0) formally as ag, and 
define the formal derivative f’ = 50... ia;a'~1; define inductively 
fF = f and f) = f-0", Prove (f +9)! = fi +g! and (fg)! = 


f'g4+ fg’. Prove the formal version of Maclaurin’s expansion: 


1 n n 
ee i le 
nen 
If f(0) = 1, then (f")' =nf"' f", for any integer n. 
Prove the “binomial expansion” for formal power series, for any 
rational number n: 
(m-1) 4. n(n-l)in-2) 3 ; 


(1+ a2)" =14+nar+ “aa ae ae 


(Hint: apply Exercise 12 to f = 1+ az.) 

If f(0) = 0, define exp(f) =1+f+ £ + rare . Show exp(f)’ = 
exp(f)f’. Likewise if g(0) = 1, define log(g) = g - sg” + z9° ses 
show (log q)' = g'g_'. Also show log exp(f) = f and exp log(q) = g. 


16. 


17. 


18. 


19. 


20. 


The Partition Number 

In Exercises 16 through 20 we shall use formal power series, coupled 
with a clever combinatorial argument, to compute the partition 
number P,, (ef. Exercise 7.13). We shall write each partition of 
n in descending order, t.e.,n = ny +n2+---+n,z with ny > ny > 
eas 

If f:(0) = 1 for each 2, then the infinite product fife fs... makes 
sense as a formal power series, since one can determine each coeffi- 
cient as a finite sum. 

Tf (ia ee See eae es, then yi gate, yeas 
where formally we take Py = 1. 

Let us call a partition n = ny +no+---+ ny, strictly descending 
if ny > ng > ++: > ny. Let Q, denote the number of strictly 
descending partitions of n. The partition n = ny +nog+---+ nz 
is called even (resp. odd) if k is even (resp. odd). Define Qt 
(resp. Q;,) to be the number of even (resp. odd) strictly descending 
partitions of n. For example, P, = 5, and Qt =Q, =1. Show 


[oo] 


(sale) tor) =) (0, = 0,32" 


n=0 


=l-r—-rterte —-2? 27h +... . 


Conclude that (S75 Paw” )(9>~ (Qi -— Q;,)2”) = 1. Deduce the 
formula: P= 95) (Op =O Pas 

Prove that Q*+, = Q>, unless m has the form aii in which case 
Qi, -— Q,, = (-1). Calculate Q;, — QF, for all values of m < 20. 


(Hint: Given any strictly descending partition m, > m2 > +--+: > mz 


of m, take 7 maximal such that m; =m — (7 — 1), #-e., such that 
my = m+1=me+2 =--- = mj +(y—-1). Let i = my. If 
i<j, then one can produce a partition of opposite parity, namely 
my tl >m,+1>--->mzp+1> mii > +++ > Mp1; similarly if 
a= yj and k > 7. On the other hand, if 7 < i—1, one can reverse the 
procedure and get the partition mj -—1 > m2.—-1>--->mj-1-1> 
mi > +--+ > m, > 4-1; similarly if 7 =i-—1 andk > 7. Thus all 
strictly descending partitions pair off in opposing parities, except in 
the case 7 =i—1 or 7 =72, where also k = 7. In each of these two 
cases, the unpaired partition must bei+j7—-—1>71+j7-2>--->42, 
som =ig+ ey substitute 7 = 7+1 andi=j.) 

Using exercises 18 and 19, conclude Euler’s formula: 


Pr = Pn i+P, 2—P, eee ao 7+ Pr wth, 1 SEB: : 


21. 


(Precisely, P, = >> (—1)7+!P,_m.) 


m=(372 +3) /2 
Unique factorization domains 

The following two conditions are equivalent for an integral domain 
Rtobe a UFD: 

(i) Every irreducible is prime; 

(ii) There is no infinite sequence 71,72,...,in R such that rj41 
is a proper divisor of 7; for each 2. (Hint: (=) Write f(r) for the 
number of primes in the factorization of r. Clearly, if r = ab, then 
L(r) = La) + £(b); by hypothesis £(71) > f(r.) > ..., so any such 
sequence has < f(r1) terms. (<=) Mimic the proof of Proposition 16. 
Compare with Theorem 17.18). 


CHAPTER 17. PRINCIPAL IDEAL DOMAINS: 
INDUCTION WITHOUT NUMBERS 


Although the applications given in the previous section are quite beautiful, 
the basic theorems on unique factorization might seem somewhat ad hoc, 
based on the fortuitous appearance of the Euclidean algorithm and the 
degree function, and are rather cumbersome to state. Actually one can 
rework the arithmetic theory more intrinsically in terms of ideals, so that 
many of the main results become easier to state, and in greater generality. 


Definition 1. A PID (principal ideal domain) is an integral domain R in 
which every ideal is principal, t.e., has the form Ra. 


Example 2. Every Euclidean domain is a PID, by Corollary 16.14. 


Actually, one can broaden Corollary 16.14 to get PID’s which are not 
Euclidean, cf. Exercises 3,4, and 5. 

The point of the PID is that it is defined concisely in terms of ideals 
and permits us to utilize ideal theory in its study. To this end, we shall 
compile a dictionary for integral domains that “translates” the relevant 
notions of number theory from elements to ideals. In an integral domain R, 
the element a corresponds to the ideal Ra, so it is appropriate to note at 
the outset that RaRb = Rab (verification left to reader), which implies 
that the set of principal ideals of R inherits a monoid structure from the 
multiplication in R, cf. Exercise 1. 


Elements Ideals Comments 
alb RbC Ra be Ra iff Rb C Ra 
a associate to b Ra= Rb Immediate 
p irreducible Rp Cc RacR Remark 16.21 
is impossible. 
p prime If RaRb C Rp then Remark 16.21 
RaC Rp or ROC Rp. 
c divides a and b Ra+RbC Re Remarks 13.10, 14.4” 
a and b both dividec Ran RbD Re Remarks 13.10, 14.4” 


Remark 2'. Let us continue this analysis. We say c is a common divisor of 
a and b if cla and c|b. The greatest common divisor gcd(a,b) of a and 6b is 
defined to be that common divisor d (of a and b) for which every common 
divisor c of a and b divides d. If gcd(a, b) exists then it must be unique up 
to associate. 
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Clearly, c is a common divisor of a and b iff Ra C Re and Rb C Re, 
or equivalently Ra+ Rb C Rc. It follows that if Ra + Rb = Rd, then 
d = gecd(a,b). Consequently, d = gcd(a,b) exists in any PID. Furthermore, 
we have the useful consequence that there are r,s in R for which d = ra+sb. 
(See Exercise 2 for a refinement of this observation.) 


One could continue this reasoning and describe the lem (cf. Exercise 
12). However, we satisfy ourselves here by noting that two elements a and 
b are relatively prime if Ra+ Rb = R; the converse is true when R is a PID. 
One method of checking the relative primeness of polynomials is given in 
Exercise 13. 

Although “irreducible” and “prime” elements have been described in 
terms of principal ideals in arbitrary rings, they are especially significant 
in PIDs. In a PID, the criterion for p to be irreducible states precisely that 
Rp is a maximal ideal (cf., however, Examples 10 and 11.) To obtain a 
similar neat condition for p to be prime, we need to introduce a new kind 


of ideal. 
Prime Ideals 


Definition 3. A (proper) ideal P of an arbitrary ring R is called prime if 
the following condition holds for arbitrary ideals A,B of R: 


If ABCP then ACP or BCP. 


“Prime ideal” is one of the basic concepts of ring theory, involving the 
structure theory in several ways. 


LEMMA 4. The following conditions are equivalent for a commutative 
ring R: 
(i) R is an integral domain; 


(ii) The element 0 is a prime element of R; 
(iii) The ideal 0 is a prime ideal of R; 
( 


iv) The product of any two nonzero principal ideals of R is nonzero. 


Proof. (Note that an ideal A C 0 iff A= 0.) 
(7) & (2) By Remark 16.21(ii). 
(22) => (i221) Suppose 0 #4 A, B«R with AB = 0. Taking 04a € A and 
046 € B,wesee RaRb C AB = 0. Hence Ra = 0 or Rb = 0, contradiction. 
(777) => (2v) Obvious. 
(iv) => (22) By definition of prime element. 0 


LEMMA 5. Suppose I C P are ideals of R. Then P is a prime ideal of R, 
iff P/I is a prime ideal of R/T. 


Proof. We use Proposition 14.7 repeatedly. Write R = R/P and P = P/I. 
(=) Suppose A,B «R with AB C P. Then there are A,B 4 R with 
A/I, B = B/I. Clearly, AB C P so AC P or BC P; i.e, AC P or 


(<) Suppose A, B« R with AB C P. Then (A/I)(B/TI) C P in R, so 
A/I CP or B/I C P. We conclude AC Por BC P.O 


PROPOSITION 6. An ideal P of a commutative ring R is prime iff R/P is 
an integral domain. 


Proof. Combine Lemmas 4 and 5. 0 
Thus an element p of a PID R is prime, iff Rp is a prime ideal. 


Remark 7. Recall (Corollary 14.8) that A is a maximal ideal of a commu- 
tative ring R iff R/A is a field. Since every field is an integral domain, we 
conclude that every maximal ideal is prime. 


PROPOSITION 8. A nontrivial ideal of a PID is prime iff it is maximal. 


Proof. (=) If Rp # 0 is a prime ideal then p ¥ 0 is a prime element and 
thus is irreducible, by Remark 16.18; hence Rp is a maximal ideal. 


(<) by Remark 7. 0 


COROLLARY 9. In a PID, a nonzero element is prime iff it is irreducible. 


Example 10. The polynomial ring R = Z[z]is not a PID. Indeed the substi- 
tution z +> 0 yields an onto homomorphism ~: R > Z. Then R/ kery = Z 
by Noether I, so ker 7} is a prime ideal of R that is not maximal (since Z is 
not a field). But f(x) € ker a iff f(0) = 0, iff z|f, so kerry = Ra 40. In 
view of Proposition 8 we conclude that R is not a PID. Note that here = is 
irreducible although Ra is not a maximal ideal. 


Example 11. Define the polynomial ring R = F[x1, x2] in two indetermi- 
nates over a field F to be (F[21])[z2]. R is not a PID. Indeed define the 
homomorphism 4): R > Fla] by 22 4 0; as in Example 10, ker = Rr, 
is a nonzero prime, nonmaximal ideal. 


Fact 12. Ina PID, “unique factorization” means that every nonzero ideal A 
can be written as a product P,...P; of maximal ideals, which is uniquely 
determined in the sense that if P,...P; = Q,...Q., for maximal ideals 
P,,..., Py, Q1,....Qy, then # =u and the Q;....,Q;: are a reassortment of 
the Py,.... Pr. 


Noetherian Rings 

We want to reprove Theorem 16.24, under the weaker hypothesis that R 
is a PID. Uniqueness of any factorization holds by Proposition 16.20 (since 
in any PID, irreducibles are prime). However, we need to prove the exis- 
tence of a factorization into irreducibles, thereby requiring a substitute for 
the induction argument used in the proof of Proposition 16.16. This is a 
tall order, since in general a PID has no number with which to build an 
induction argument; nevertheless there is is a way that bears the name of 
E. Noether, its discoverer. 

To motivate the argument ,let us review the proof of Proposition 16.16 
applied to some positive number, say 48. To prove 48 is factorizable into 
primes, we could first factor 48 into two proper divisors, say 48 = 3 - 16, 
and then note by induction that 3 and 16 are factorizable into primes. But 
we did not use the full induction hypothesis — indeed we only needed to 
apply induction to divisors of 48. Concerning an element a in general, our 
hypothesis need apply only to proper divisors of a, or, translating to ideals 
in a PID, we need consider only principal ideals properly containing Ra. 
Carrying this idea further, we want to be sure that a given process must 
terminate when we pass on to larger and larger principal ideals. This leads 
us to the following definition. 


Definition 18. A ring R satisfies the ascending chain condition on ideals 
(denoted ACC(ideals)) if any chain A; C Ay C A3 C... of ideals necessar- 
ily terminates, i.e., there is n such that A, = An +i =... . Rings satisfying 
ACC(ideals) are also called Noetherian. 


Example 14. Any Euclidean ring R is Noetherian; indeed if Ra; C Rag C 
Raz C... then deg a, > deg az > dega3 > ..., so this chain must termi- 
nate. More generally we have 


PROPOSITION 15. Any PID is Noetherian. 


Proof. Suppose Ay C Az C... are ideals of R. Then Remark 14.4” implies 
Ui>1A;4R and so has the form Ra, by hypothesis. Hence a € A, for some n, 
implying AC A, C Anyi C--- C A, yielding A, = Any, =... .O 


ACC (ideals) implies the following formally stronger property: 


PROPOSITION 16. R is Noetherian iff every set S of ideals of R has a 
maximal member (i.e., some ideal of S is maximal among those in S ). 


Proof. (=) Take any ideal A; in S. If A, is maximal, then we are done; 
otherwise S has some Ay D A,. Continuing in this way we take 


A, C Ap C AZ C... 


and by definition are forced to stop at some A,; thus A, is maximal in S. 
(<=) Given A; C Ay C Ag C... let S={Aj, Az, Az,...$; some A, is 


maximal in S, so A, = Anyi =....0 


CoROLLaRY 17. (But see Aside 20.) Any ideal of a Noetherian ring is 
contained in a maximal (proper) ideal. 


Proof. In Proposition 16, take S to be the set of proper ideals. 0 


ACC provides a suitable substitute for mathematical induction. One 
technique of verifying a given property P on a class of ideals of a Noetherian 
ring R, is to assume on the contrary that P does not hold for all such ideals; 
then take an ideal maximal with respect to not satisfying P, and arrive at a 
contradiction. This method is called Noetherian Induction. Here is a good 
illustration. 


THEOREM 18. Every PID is a UFD. 


Proof. As observed above, we only need to prove that a factorization into 
irreducibles exists. If not, take Rag maximal of all principal ideals Ra for 
which a fails to have a factorization into irreducibles. Clearly, ao itself is 
not irreducible, so Rag is properly contained in a maximal ideal P = Rp. 
Write ag = bp. Then Rag C Rb, so by hypothesis there is a factorization 
b= p,...pp into irreducibles; then a9 = p,...pzp, contradiction. 0 


ACC (ideals) turns out to be the key hypothesis in the structure theory 
of commutative rings, in view of Exercises 19, 20, and 21; for example, for 
any primitive root p of 1, the important ring Z[p]is Noetherian, even though 
Z[p] need not be a UFD. Easier examples of this phenomenon are given 
in Exercise 6. In general, ideals have replaced elements in the structure 
theory. Prime ideals have turned out to be the “correct” generalization of 
prime elements, even for non-UFDs. 

It is not so easy to construct an example of a ring that is not Noether- 
ian. One approach is given in Exercise 24; however, I prefer the following 
example, since it illustrates how to transfer “bad” properties from groups 
to rings. 


Example 19. Let R= {f(x) € Qz] : f(0) € Z}. Thus f € Riff f =m+zg9, 
where m € Z and g € Q[z]. Clearly, R is a ring, and for any subgroup G 
of (Q,+) we have the corresponding ideal Gr + 2?Q[z] of R. Now let G; = 
{t:m € Z}, a subgroup of (Q,+), and let A; denote the corresponding 
ideal of R. The infinite ascending chain of subgroups 


Go C G4 CGC... 


translates to an ascending chain of ideals Ap C Ay C Ag C... in R. 


We review different kinds of rings in increasing generality from left to 
right: 


Noetherian 


x 


Euclidean domain > PID UFD — Integral domain 


Aside 20. Ironically, the conclusion of Corollary 17 holds without any hy- 
pothesis, by “Zorn’s lemma,” one of the pillars of ring theory: 


Zorn’s Lemma. Suppose A is any set and S is a collection of subsets of A, 
such that whenever A; C Ay C ... with each A; in S we have UA; € S. 
Then S contains some maximal member. 


The set of proper ideals of an arbitrary ring R satisfies the condition of 
Zorn’s lemma, since if Ay C Ap C ... are proper ideals of R, then, clearly, 
1 ¢ UA;, implying UA; is a proper ideal. Zorn’s lemma thus shows that 
R has maximal (proper) ideals. This application of Zorn’s lemma requires 
the existence of a unit element of R. 

The proof of Zorn’s lemma relies on the “axiom of choice” from set 
theory, but this axiom is normally accepted by algebraists (because they 
do not want to give up Zorn’s lemma!). 


Exercises 


1. In the “dictionary” add the following entry: the monoid of nonzero 
principal ideals of an integral domain R (with R as the neutral ele- 
ment) corresponds to the monoid of equivalence classes of nonzero 
elements of R. (two elements being equivalent when they are asso- 
ciates). Show in the cases R = Z and R = F{2] that this monoid 
can be identified with a suitable submonoid of R, namely Nt and 
{the monic polynomials} respectively. For this reason one some- 
times deals more generally with unique factorization monoids in- 
stead of unique factorization domains. Fact 12 says that the monoid 
of nonzero ideals in a PID is a unique factorization monoid. 

2. If d = ecd(a,b) in a PID, then one can find r,s with r relatively 
prime to a, such that d=ra-+ sb. (Hint: Write a = ad and b= bd. 
Then 1 = ra4+ sb. Let q be the product of those primes dividing 
a that do not divide r, and let r! = r+ bq, s' = s — dg. Then 
ecd(r’,a) = 1 since r',a have no common prime divisor; on the 
other hand, r'a + s'b = d.) 


Counterexamples 

3. Call an integral domain R quasi-Euchidean if R has a function 
d: R — N satisfying the following property: If d(a#) > d(y), then 
either y|x or there are z,w € R for which d(az— yw) < d(y). Mod- 
ifying the proof of Theorem 16.11, show that any quasi-Euclidean 
domain is a PID. 

4. Given any ring R and S C R, define $= {s€S:a+Rs CS for 
some a in S}. Define Ro = R \ {0} and inductively Rj, = R; for 
each 2 > 1. Note that Ry = {noninvertible elements of Ro}. Show 
that if R is Euclidean, then (),- Ri = . (Hint: By induction, 
d(s) > n for all s in Ry.) 

5. Let R = {a+bG8 :a,be z\ CC. Show R is quasi-Euclidean 
(and thus a PID) but not Euclidean. (Hint: Given x,y relatively 
prime, we want to find z and w for which d(xz — yw) < 1. Write 
-= athe 9 where c > 1 and (a,b,c) =1. For c > 5 take numbers 
d.e, f,q,7, for which ae + bd+ef =1, ad—19be =eq+r, |r| < $. 
Put z = d+e/—19 and w = q— fyY—19. If c = 2, then a is 
odd [or else y|z], so take w = at + by 18 z= 1. Force = 3,4 
take z = a — b\/—19, and work out w. Hence R is quasi-Euclidean. 
To show R is not Euclidean it suffices to show R; = Ry» in the 
notation of Exercise 4. But Ry = {R \ {0,41}; if s € Ri \ Re, 
then for each a in R, there is 6 in R for which a+bs ¢ Ri, ie., 
—bs € fa—1, a, a+1}. Taking a = 2, show that —bs € {1,2,3} 
so s = +2 or +3. Now get a contradiction by taking a = ibys ) 

6. Z[V10] is not a UFD. (Hint: 9 is a product of two primes in two 
inequivalent ways). Similarly, Z[/—6] is not a UFD. Using the fact 
that Z[z] is a UFD, to be shown in Chapter 20, conclude that an 
integral domain that is the homomorphic image of a UFD need not 
itself be a UFD. 

7. Using the idea of Example 10, show that, for any n in Z, the ideal 
of Z[z] generated by # — n is prime but not maximal. (You might 
want to appeal to lemma 18.2.) 

8. Explicit illustrations of Examples 10 and 11: R = Z[z] has the 
nonprincipal ideal Ra + 2k; R = Fla2,,22] has the nonprincipal 
ideal Rx, + Reo. 


Consequences of Zorn’s Lemma 

9. State the analog of Zorn’s lemma, using descending chains instead 
of ascending chains. Nevertheless, many rings (such as Z) fail to 
have minimal nonzero ideals. What goes wrong? 


10. 


11. 


12: 


13. 


Suppose S$ is any submonoid of R\{0}. Then R has a prime ideal P 
that is disjoint from S. (Hint: By Zorn’s lemma there is an ideal 
of R that is maximal with respect to being disjoint from S'; prove 
it is a prime ideal.) 

An element a of a ring R is called nilpotent if a” = 0 for suitable 
n > 0. An integral domain cannot contain any nonzero nilpotent 
elements. In an arbitrary commutative ring R, prove that the in- 
tersection of all the prime ideals of R consists precisely of the nilpo- 
tent elements of R. (Hint: If a € R is nilpotent, then a is contained 
in every prime ideal P since its image in R/P is 0; if a is not 
nilpotent, then by Exercise 10 there is a prime ideal disjoint from 
{a":n€N}.) 

As in Remark 2', one can define a common multiple of a and b; 
we write c = Icm(a,b), called the least common multiple, if c is 
a common multiple which divides every other common multiple. 
Show c= Icm(a,6) if RaN Rb = Re. Such c exists in a PID and is 
unique up to associate. 

(“The resultant polynomial”) Two polynomials f(x) = 77, aix" 
and g(#) = Vy b;x over a field F are relatively prime iff the 
following matrix is nonsingular: 


Gm Am—1 «a. ao 0 0 0 
0 Gm Gm—1 +. Go 0 0 
0 0 a 
A= : 
bn bnoa bo 0 0 
0 bn . by bo 0 
0 0 bo 
2? t™—! gg 0 
n+m—2 10 0 
, 2?t™-3q1 .. 0 
(Hint: Let B= : ; . Then 
x 00... 10 
1 oo... 1 


[Alo"*""! = |AB| = f(x) h(x) + g(a)k(2), 


where degh < n-1, degk < m—1. If |A] 4 0 and f,g have a 
nonconstant factor r(az), then x divides r, so ag = 0 = bo, implying 
|A| = 0 after all. Conversely, if |A| = 0, then fh = —gk;: since 
deg f > deg k, conclude that some irreducible factor of f divides g.) 


14. 


15. 


16. 


17. 


18. 


19: 


20. 
PA 


22. 


23. 
24, 


UFDs 

(see also Exercise 16.21 and Exercises 20.3 through 20.7) 

Define the ACC on principal ideals in anology to Definition 13. 
The ACC on principal ideals implies factorization (not necessarily 
unique) into irreducible elements. 

Show that R is a UFD, iff R satisfies the ACC on principal ideals 
and every irreducible element is prime. (Hint: See Exercise 16.21) 
If R is a UFD, then the intersection of any two principal ideals is 
principal; furthermore, any two elements of R have both a gcd and 
an lem, and gcd(a, b)lcm(a,b) = ab. 

Ris a UFD, iff every nonzero prime ideal of R contains a prime 
element. (Hint: Let S = {finite products of prime elements}. If 
r ¢ § then Exercise 10 implies that r is contained in a prime ideal 
P of R disjoint from S$, contrary to the condition.) 

In a UFD every minimal nonzero prime ideal (i.e., does not contain 
any other nonzero prime ideal) is principal. 


Noetherian Rings 

We say an ideal L of a commutative ring R is finitely generated 
iff F = Ra, +---+ Ra; for suitable elements a1,...,a,; of R (and 
suitable t). Prove R is Noetherian iff every ideal of R is finitely 
generated. (Hint: (=>) by the proof of Proposition 15.) 

The homomorphic image of any Noetherian ring is Noetherian. 
“Hilbert Basis Theorem” If R is a Noetherian ring then the poly- 
nomial ring R[x] is Noetherian. (Hint: Given EL « R[a], define 
[Ty = 0Ufa € R: ais the leading coefficient of some polyno- 
mial of degree < k in E}<R, for each k > 0; write Ly = ee Ray, 
for suitable aj, in R, and take fy, € L of degree < k having lead- 
ing coefficient a,;,. Note that L, C Ly C.... Taking n such that 
Le Dei = us CONCMdE Le = ye Rfiz-) 

In a Noetherian ring, any ideal contains a product of prime ideals. 
(Hint: From the proof of Theorem 18.) 

The rings of Exercise 6 are Noetherian, although they are not UFDs. 
Generalizing Example 11, define inductively the polynomial ring 
Play,2,...,@4], as Play, 22,...,%4—-1][2+], and show it is a Nocther- 
ian integral domain. Viewing F[21,22,...,21+-1] C Flai,22,..., 24] 
in the natural way, define R = UP, Flx1,22,..., 2+]. Note that 


Rar, C Rr,+ Rr C Rr, + Rr.4+ Rez Cpe 


is an infinite ascending chain of ideals in R, so R is not Noetherian; 
however, Ris a UFD, cf. Exercise 20.7. 


CHAPTER 18. ROOTS OF POLYNOMIALS 


This chapter (and much of our subsequent material) grows out of the fol- 
lowing innocuous application of the Euclidean algorithm for polynomials: 


Remark 1. (Notation as in Proposition 16.7.) When R is commutative and 
g = xz-—cwithc € R, we can calculate r directly, by means of “substituting” 
c for x; since r is constant and g(c) = 0, 


r= fle) —alejgle) = fle) (1) 


Although the reader might be willing to accept this Remark without 
further ado, let us be careful and justify the substitution we have just made, 
by proving 


LEMMA 2. “Substitution Lemma.” Suppose R is commutative. For any 
given c € R, there is a “substitution” homomorphism R[x] > R defined by 
So ajr' 4 So a;c'. (In other words we “substitute” c for x.) 


Actually the following more general result will also be useful (One 
recovers Lemma 2 by taking C = R and 7 = 1p). 


LEMMA 3. Suppose C is a commutative ring, and c € C. Any given 
ring homomorphism 1: R > C extends uniquely to a ring homomorphism 


yp: R[a] > C, such that p(x) = ¢ and y(a) = (a) for all a in R. 
Proof. If such y exists, we must have 


eo ai") = VF playa) = So dade’, (2) 


proving uniqueness. It remains to show that y as defined by (2) is a homo- 
morphism. 


(So aia + bia’) = pl (as + de" =De (a; + b:)¢ 
epee, 
= (>> a;x") J+ “a 
eae) Oe) = oboe ajby jx" 


u i=0 
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(Here the hypothesis C commutative was needed to move c past #(b;).) 
Finally, g(1) =y~(1)=1. O 


The careful reader may have spotted an application of a special case 
of Lemma 2 in Example 17.10. One can weaken the hypotheses a bit more, 
cf. Exercise 1. 


COROLLARY 4. Suppose R is commutative, andc € R. For any polynomial 
f(x) in R[x], f(c) = 0 iff x — e divides f in R[x]. 

Proof. Write f = qg+r, where g = x—c. Remark 1 yields r = f(c), which 
is Oiff (2—c)|f.0 


THEOREM 5. (Easy part of the Fundamental Theorem of Algebra.) Sup- 
pose R is an integral domain, and a,,...,a4 are distinct roots of f. Then 
t < deg f, and f = (2 — a)...(2 — a,)h for suitable h € R[x], where 
deg h = deg f — t. 


Proof. Induction on t, the case t = 1 being Corollary 4. In general write 
f = (t#-—a1)g. Note degg = deg f —1. But az,...,a4 are roots of g; 
indeed, for each 7 > 1 we see 0 = f(a;) = (a; — 1) 9(a;) implying g(a;) = 0. 
By induction t — 1 < deg g and g = (% — ag)... (a — a4)h, where deg h = 
deg g—(t—1) = degf —t, implying f = (a-a1)g = (w-a1)...(2-ay)h. 0 


CorRoLuary 6. If R is an integral domain and the polynomial f € R[x] of 
degree n has n distinct roots a1,...,@, in R, then 


f=c(@—a,)...(a@- an), 


where c is the leading coefficient of f. 


Proof. Write f = c(~—a,)...(z—a,) where c € R[x]; then deg c = n-—n = 0 
so c is constant; clearly, the leading term of f is cx”. O 


There is a very nice application to fields. 
Finite Subgroups of Fields 
THEOREM 7. Any finite multiplicative subgroup G of a field is cyclic. 


Proof. Let n = |G| and m = exp G. By remark 7.12! it suffices to prove 
n=m. Clearly, n >m. On the other hand, each of the n elements of G is 
a root of the polynomial z™” — 1, so n <m by Theorem 5. O 


This was so easy that we might wrongly be tempted to slight its impact. 
So let us quickly give a deep consequence. 


COROLLARY 8. Euler(p) is cyclic, for every prime number p. 
Proof. Apply Theorem 7 to the field Z,. 0 


However, I challenge the reader to find a constructive proof, i.e., to 
determine systematically a generator of (Z,\{0},-). Concerning the struc- 
ture of Euler(m) for m not prime, recall that Euler(8) % (Z2,+) x (Z2,+) 
is not cyclic; the general situation is described in Exercise 4. 


Primitive Roots of 1 

Consider an arbitrary finite subgroup G of C. By Theorem 7, G is cyclic, so 
we write G = (a). Then a” = 1 for n = |G, so a is a root of the polynomial 
z” — 1, one of the most important polynomials we shall encounter. Let 
us take a few moments to study its roots in C. The most straightforward 
approach is by means of the unit circle C = {a+iy: 27 +y2 = 1} = 
{cos@ + isin 6:6 € R} on the complex plane. 


Remark 9. We can understand C better by defining e’” = cos@ + isin @. 
Then e?7' = 1, and 


e914) — cos(O, +O) + i(sin @, + 2) 
= cos 6; cos Oy — sin 6; sin 6) + i(sin 6; cos 6 + cos ; sin A) 
= (cos 6, +7sin 6, )(cos 62 + isin 62) 
161 102 
sere, 


Consequently, Cis a multiplicative subgroup of C, and there is a nat- 
ural group homomorphism R > C' given by #6 e”*. 


Qrifjn LAnif/n Qnrif/n _ 
7 € .° = 


Now we see that the roots of 2” —1 are fe e 


1}, so z”—1 has precisely n roots in C. Certain of these roots play a special 


role. 


Definition 10. pis a primitive nth root of 1 if p” = 1 but p™ ¥ 1 for all 
l<m<n. 


If p is a primitive nth root of 1, then {p* :0< k <n} are distinct, so 
we have 


Remark 11. {nth roots of 1} is a cyclic group (under multiplication) which 
is isomorphic to (Z,,+) and is generated by any of the primitive nth roots 


Qhri/n 


of 1. Consequently e is a primitive nth root of 1 iff (k,n) = 1; the 


number of primitive nth roots of 1 is the Euler number y(n). 


Remark 12. If p #1 is an nth root of 1 then pea p! = —-1. (Indeed p is 
a root of 2” —1=(2" '+---4+1)(2—-1), so pis a root of r?~'+---+1.) 


These observations will elevate the Euler group to a key tool in field 
theory. However, let us first give a quick application to number theory. 


Remark 13. n = eos p(d), for any n. Indeed, any nth root of 1 is a 
primitive dth root of 1 for some d dividing n, so the equality is obtained 
merely by calculating the number of nth roots of 1 in this manner. 


Roots of 1 also are used in factoring other polynomials. 


PROPOSITION 14. If F is a field containing a primitive nth root p of 1 then 
x” —b" =(x—b)(x2— pb)...(x% — p®~"b) 


for any b in F. 


Proof. b, pb,...,p"~'b are all distinct, and (p*b)" = p*"b™ = (p”)*b™ = b” 
implies each p*b is a root of 2” — b”. Hence we are done by Theorem 5. 0 


COROLLARY 15. If F is a field containing a primitive nth root p of 1 then 
for all a,b in F we have 


a” — 6" = (a—b)(a— pb)...(a— p” db) 


Proof. Substitute a for x in the Proposition. 0 


Exercises 


1. (A noncommutative generalization of Lemma 3): Suppose T is 
any ring, and suppose c € T commutes with all elements of T. 
Any homomorphism 7: R > T extends to a unique homomorphism 
yp: R[z] > T, such that g(r) = w(r) for all r in R and g(x) =e. 


The Structure of Euler(n) 

2. If 8|n, then Euler(n) is not cyclic. (Hint: £1 and £(1+ 3) are four 
distinct elements whose squares are 1.) 

3. For any t > 1, the number of solutions of a? = 1 in Euler(p*‘) is 
at most p. (Hint: Prove by induction on t > 2 that if a? = 1 
(mod p*), then a = 1 (mod p*—!). Indeed, for t > 2, by induction 
a =1 (mod p'~?), so write a= 1+ kpt-? and take p powers.) 

4. Euler(n) is the direct product of groups of the form Euler(p*) for p 
prime, according to the factorization of n into prime powers. Sup- 
pose n is a prime power. If 8 does not divide n then Euler(n) is 
cyclic; if 8 divides n, then Euler(n) © Zz X Z ginjo- (Hint: The first 
assertion is Exercises 6.5 and 6.6. For n = p* and let G = Euler(n). 
Then |G] = y(n) = p'*(p — 1). 


CASEI. p #2. There is a natural surjection 45: G— Euler(p), 
given as in exercise 4.8, so G has an element a of order p—1 = y(p), 
by exercise 4.9. On the other hand, ker y has order p'~1, so G is an 
internal direct product of ker) and (a). It suffices to prove ker y is 
cyclic, which follows from Exercises 3 and 7.2(i). 

CASE II. p = 2. One may assume t > 3. G is not cyclic, by 
Exercise 2, so it suffices to find an element 6 of order 2*~? = 7. Show 
via induction on ¢ that [5] has order > 2'~? in Euler(2*); hence take 
b = [5].) 

5. How many roots does the polynomial z° — 1 have in each of the 
following fields: Z2,Z3,Z5? In general how many roots does x” — 1 
have in Z,? 

6. 2 — 1a" — 1 iff mln. (Hint: «™ — 1 divides 2” — 1, iff ©” -— 1 
divides 2” — 2” = 2™(2"—™ — 1).) 


7. For any a,b in a field containing a primitive nth root p of 1 show 


n-1 


S- aib"—1-3 = (a — pb)... (a — p10); 


j=0 
1+b4---+6°1 = (1— pb)... (1- p18); 
n=(1—p)...(1—p""). 


(Hint: Cancel « — 6 from each side of Corollary 15.) 
8. (R/Z,+) & C, where C denotes the unit circle. 


WEEK 19. APPLICATIONS: FAMOUS RESULTS 
FROM NUMBER THEORY 


We are now ready to tackle number theory in Euclidean rings other than Z. 
Our motivation is that polynomials such as 


2” —1=(2—-1)(2% 1427-7 4.---41) 


factor more completely over extension rings of Z and thereby may enable 
us to solve equations more readily by means of these larger rings. We 
shall illustrate this fact in two famous results related to the famous French 
mathematician Pierre Fermat. Actually Fermat was a jurist by profession 
and never published his proofs; nevertheless, we believe that he knew the 
proofs, since there is only one mistake he is known to have made in his 
mathematical career. 


A Theorem of Fermat 

The first result concerns the following question about natural numbers: 
Which prime numbers p can be written as a sum of two square integers, 
i.€., 0° -+b?? Since a sum of two squares is clearly positive, we shall assume 
throughout that p > 0. It helps to look at the primes under 50. 


p: 2 3 5 7 11 13 17 19 
Sum: 14+1 NO 441 #4\NO NO 9+4 164+1 NO 


p: 23 29 31 37 Al 43 47 
Sum: NO 2544 NO 3641 254+16 NO NO 


Looking at these examples, we can conjecture Fermat’s a result: A 
prime number p # 2 has the form a? + 6? for suitable a,b in N iff p = 1 
(mod 4). 


One direction can be proved rather easily. 


Remark 1. If a is odd, then a? = 1 (mod 4); if a is even then a? = 0 
(mod 4). (Indeed, if a = 2k +1 then a? = 4k? + 4k +1 =1 (mod 4).) 


Let us use these simple computations, assuming p = a? -+b?. Excluding 
the prime p = 2, we have p odd. Then, clearly, either a or 6 must be odd; 
we may assume a is odd, and then, clearly, 6 is even. Thus, 


p=e@+b? =1+0 (mod 4)=1 (mod 4). 


Hence there are no solutions for p = 3 (mod 4). 
The remainder of our discussion focuses on proving conversely that 
every prime p = 1 (mod 4) does have a solution p = a? +67. A very 
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short proof of this fact is given in Exercise 1, but that proof is not easy to 
reconstruct. A more intuitive proof is motivated by the observation 


a +b? = (a+ bi)(a — bi) = yy, 


where y = a+ bi, i= /—1, and ~ is complex conjugation in C. However, 
the ring C is too large for our purposes. Since we are only interested in 
integral solutions, we note that this factorization also holds in the subring 
Zi] = {m+ni:m,n € Z} of C, called the ring of Gaussian integers. Thus, 
any integer of the form a? + 6? is not prime in Z[#], thereby leading us 
towards the following assertion, where we write N(y) for yy: 


THEOREM 2. The following assertions are equivalent, for a prime p # 2 
in N: 

(i) p= a? +0? for suitable a,b in N; 

(ii) p=1 (mod 4); 

(iii) p = N(y) for some y in Z[:]; 

(iv) p is not prime in Zz]. 


Proof. We have just seen the implications (i) > (ii) and (i) > (iv); fur- 
thermore, (i) © (iii) is immediate, since N(a + bi) = a? + b?. We shall 
demonstrate (ii) => (iv) = (iii) to complete the proof. But first we need to 
study arithmetic in Z[z]. Note that N(yz) = yzyz = yyzzZ = N(y)N(z) for 
all y, z in Z[:]. 


THEOREM 3. Z|?] is a Euclidean domain, with degree function d(y) = N(y). 


Proof. We verify Definition 16.8. Property (1) is clear since N(r) > 1 
for every 0 4 r in Z[z]. It remains to show, given y,z in Z[7] \ {0}, that 
y=qze+r for q,r in Z[i] satisfying N(r) < N(z). 

Since C = R[#] is a field, we have 4 € C, i.¢., 4 = uy + gi for suitable 
uy,t2 in R. Thus, y = (uy +ugi)z. For 7 = 1,2, let q; be the integer closest 
to uj, and let vj = u; — gj, so |v;| < >. Then 


algae 
=e 1 


N(v1 + 2i) = vy +02 <5 r 


letting r = y — (q1 + qot)z = (v1 + v22)z, we see that 
N(r) = N(x, + v9t)N(z) < N(z), 


so we conclude by writing y = (qi + q@i)z+r.0 


We need a few more facts, in order to perform arithmetic in Z[?]. 


Remark 4. (i) If ylz in Z[i], then g|zZ. (Indeed, if z = yq, then Z = 9@.) 

(ii) If c,d € Z and eld in Z[z], then eld in Z (since writing d = c(a+bz) = 
ca + ebi for a,b € Z we match real parts to get d= ca). 

(iii) y € Zz] is invertible iff N(y) = 1. (This follows at once from the 
fact that (a+ 62)71 = Rita) 


Proof of (iv) => (iti) in Theorem 2. Take a non-invertible, proper divisor 
a+bi of pin Z[i]. Then N(a+b2) divides N(p) = p? in Z, but N(a+bi) #1 
and N(a+ bi) # p?, both by Corollary 16.13. Hence N(a+ bi) =p. 


Proof of (a1) => (iv) in Theorem 2. This is more number-theoretic. Re- 
calling that Fermat’s Little Theorem can be obtained as an immediate 
application of Lagrange’s Theorem to the group Euler(p), we are led to try 
to exploit the extra information that Euler(p) is cyclic (Corollary 18.8). Let 
c be a generator of Euler(p), and write p = 4m-+ 1 for suitable m. Then 
o(e) = |Euler(p)| =p — 1 = 4m, so c#™ = 1 (mod p) but c?” #1 (mod p). 
Taking u = c?™ we see u #1 but uw? = 1 in Z,. But the field Z, has only 
the two solutions +1 to the equation 7? — 1 = 0; hence « = —1 (mod p). 
Taking d=e™ yields d? = u = —1 (mod p), so 


p divides (d? +1) = (d+i)(d—-1). (1) 


Now suppose p were prime in Z[i]. Then p divides d+ or d— i, so 
Remark 4(i) implies p divides both d+7 and d —7, and so 


p\((d-+4) — (d—1)) = 2%, 


i.€., p|2, contradiction. Thus, p is not prime in Z[:], as desired. 0 


Addendum: “Fermat’s Last Theorem” 

The raison d’étre for much of the subject of “modern algebras” lies in 
a certain problem in number theory, which we shall discuss here. In a 
stimulating book, The Last Problem, (Gollantz: London, 1962), E.T. Bell 
argues that this question lies at the foundation of human knowledge and 
endeavor. Certainly it has inspired generations of mathematicians and has 
inspired many outstanding developments in algebra and algebraic geometry. 
The ancient Egyptians were aware that 3? + 4? = 5?, and it is not difficult 
to find an infinitude of solution of a? + 6? = c? for a,b,c positive integers, 
merely by taking u,v € N arbitrarily, and putting 


a=w—vw, b=2uv, c=H=u? +r? 


These solutions are called “Pythagorean triples,” after the famous 
Greek mathematician and philosopher Pythagoras, but may well have been 


known previously by the Babylonians some 3500 years ago, cf. Edwards’ 
book Fermat’s Last Theorem, Springer-Verlag, 1977. It is somewhat harder 
to show that all solutions with a,b,c relatively prime can be obtained in 
this way, but the proof is still “elementary” and was known by the great 
classical Greek mathematician Diophantus, cf. Exercise 3. 

The obvious continuation is, “Are there solutions of a? + 6° = ¢? for 
nonzero integers a,b,c?” , or more generally, “Are there integers n > 2, and 
a,b,c # 0 such that a” + 6" = c”?” This problem was brought to the 
attention of the mathematical world by Fermat, who jotted in the margin 
of a book (Bachet’s translation of Diophantus’ magnum opus, Arithmetic) 
that he had discovered a truly marvelous demonstration for proving there 
are no solutions. As usual, Fermat neglected to write down his proof, and 
since then the greatest mathematicians of the world have tried to rediscover 
his proof (or find new ones). Recently (June, 1993) a solution to Fermat’s 
last theorem has been announced and outlined by Wiles. His proof builds 
on some of the most difficult mathematics of recent years, and is expected 
to take some time to verify. 

Possibly by the time this book has appeared, Wiles’ proof will have 
been found to be complete. This might cause a let-down, because the great 
quest of mathematicians is complete. However, one should view it as a 
great tribute to the human spirit, that the final solution of Fermat’s Last 
Theorem came before World War III or any other final solutions. 

Let us see what is available via the methods of this course. Even the 
case n = 3 is no pushover. Attempting to generalize the approach used for 
n= 2 in Exercise 3, let us write 


BP=e-a@=(ce-a\(e+cata’). 


Unfortunately, the right-hand side is not yet factored sufficiently to carry 
out the argument. However, working instead in any ring that contains a 
primitive cube root p of 1, we can write 


b? = (e—a)(c— pa)(e— p’a). 
This implies the three right-hand factors are “almost” cubes, but instead of 
reaching a solution analogous to Pythagorean triples, one instead concludes 
by induction that no solution exists! Before proceeding with the details, let 
us develop some properties of p. We define Z[p] = {a+ bp: a,b € Z}. 
Remark 6. (i) p? = —p—1 by Remark 18.12. implying Z[p] is a ring. 
(Indeed, 


(a + bp)(e+ dp) = ac + bdp* + (ad + be)p = (ac — bd) + (ad + be — bd)p. 


proving Z[p] is closed under multiplication.) 

(ii) Since p? is the other primitive cube root of 1 we see by the quadratic 
formula (applied to (i)) that p? = p. In particular, letting N(y) = yy we 
have N(a+ bp) = (a+ bp)(a+ bp?) =a? +b? —ab EN. 

(iii) Z[p] is a Euclidean domain, again with degree function N( ) (cf. 
Exercise 4), and thus a UFD. 

(iv) If ecd(a,b) =1 in Z then gcd(a+bp,a+t bp”) divides 1— p in Z[p]. 
(Indeed, let d = gcd(at bp,a+bp?). Then d divides (a+bp)p—(a+bp?) = 
a(p —1), and d also divides (a + bp) — (a + bp?) = b(p — p?) = bp(1 — p). 
and thus d divides gcd(a(1 — p),6p(1 — p)) =1-p.) 

(v) If z € Z[p] and N(z) = 1, then z is invertible; consequently if N(z) 
is prime in Z then z is prime in Z[p]. (For if z = yy’ then N(z) = N(y)N(y') 
in N, so N(y) or N(y'‘) is 1, implying y or y’ is invertible.) 

(vi) Let ¢g=1-—p. N(q) = 3, so qis prime by (v). Note p=1 (mod q), 
soa+bp=a+tb (mod q). 

(vii) Every element of Z[p] is congruent to one of {0,1,2} (mod q). 
(Indeed, take z € Z[p]. z is congruent to some integer m by (vi), which in 
turn is congruent to one of {0,1,2} (mod q), since q|3.) 


THEOREM 7. (Fermat’s Last Theorem for n = 3.) There are no integers 
a,b,c #0 such that a2 + BF = c3. 


Proof. We shall rely heavily on the prime q = 1 — p of Remark 6(vi) and 
the calculation (for k < 7 ) 
(y — p?z) — (y— p*z) = p*z — pz = ph2(1 — p-*) (2) 
=p (l= pips vet pe PN, 
In particular, y — p?z = y — p*z (mod q). 
Claim 1. If y= z (mod q) and q{ z, then y? = z? (mod q*). 


Proof of Claim 1. y? — 2 = (y — z)(y — pz)(y — p?z) by Corollary 18.15. 
By hypothesis q|(y — z) so (2) implies q|(y — p?z) for each j . Writing 


y — plz = qw;, we see that wj41 — w; = p?z = z (mod q), so in particular, 
w1,W2,W3 are all distinct modulo g. Hence some w; = 0 (mod q), by 
Remark 6(vii), so q-q-q = q' divides (y — z)(y — pz)(y — p?z) =? — 2°, 
as desired. 


Let us return to the main proof. Replacing c by —c we may rewrite 
the equation as 


a+bh+e=0. (3) 


We shall prove there is no solution for a,b,c #4 0 in Z[p]. Otherwise take 
such an example with N(c) minimal. Then a,b,c are pairwise relatively 
prime; indeed, if, for example, some prime p divided a,b then p divides 
ce? = —a® — b, and thus ple, so we could divide a,b,¢ each by p and have a 


smaller solution to (3). 
Claim 2. a,b, or ¢ is divisible by q. 


Proof of Claim 2. Otherwise, by Remark 6(vii) each of a,b,c is congruent to 
+1 (mod gq), so Claim 1 implies each of a3, 63, ¢? is congruent to +13 = +1 
(mod q*). But then 0 = a? + b? +c? is congruent to +1 or £3( mod q*), 
which is impossible since g* = ((1 — p)*)? = (—3p)? = 9p? is an associate 


of 9. 
We may thus assume gla, and conclude by proving more generally 


Claim 3. It is impossible that uq?a? + 6? + c? = 0, for u invertible in 


Z[p| and a,b,c in Z[p] prime to p. 


The proof is straightforward but involves lengthy computations, so we 
sketch it in Exercise 5. Thus Fermat’s Last Theorem is proved for n = 3. 

To prove Fermat’s Last Theorem in general, it would suffice to assume 
n= 4 or n is an odd prime (Exercise 7). The proof of n = 4 is easier than 
the above proof (cf. Exercise 6), so we are left with the case where n is 
prime. What is special about 3 in the above proof? The reader who traces 
through the proof will spot only two places where 3 cannot be replaced at 
once by an arbitrary odd prime number n: 

(i) Z[p] needs to be a UFD, for p a primitive nth root of 1; and 

(ii) The argument concerning invertible elements at the end of Exer- 
cise 5 is quite special, cf. Exercise 8. 

The program of proving Fermat’s Last Theorem along these lines was 
known by Lagrange and others, and who knows, possibly even by Fermat 
himself. Unfortunately, it turns out that Z[p] is not a UFD when n = 37. 
(However, it is Noetherian, of course.) Efforts to overcome this and other 
related questions in number theory led Kummer in the 1840s to a very 
careful investigation of Z[p], called the ring of cyclotomic integers, thereby 
laying the foundations for the subject of algebraic number theory. 

The twentieth century ushered in the geometric approach to Fermat’s 
Last Theorem. Clearly a" +b" = c" iff (4)"+(4)" = 1, so that the rational 
points of the graph z”+y” = 1 correspond precisely to the integral solutions 
of a” + 6" =c”. Thus, Fermat’s Last Theorem is reduced to showing that 
certain algebraic curves have no nontrivial rational points. This has inspired 
rapid strides in algebraic geometry in the past fifteen years, and the ensuing 


powerful techniques are so much more advanced than the methods available 
even 30 years ago, that what Bell regarded in 1961 as “The Last Problem” 
of civilization is generally regarded as solved. Recent work is far beyond 
the scope of this course. 

A mathematical history of some of these developments is given in Ed- 
wards’s book cited above. P. Ribenboim has written a stimulating set of 
lectures on Fermat’s Last Theorem (Springer-Verlag, Berlin, 1979), and his 
Chapter 4 contains some striking results that have been obtained through 
elementary means, including an “elementary” proof for even n, discovered 
in this century. So one wonders, after all, might Fermat have had a proof? 


Exercises 
1. A quick proof of Theorem 2 ((iz) > (i)): Let S = {(a,b,c) EN: 
a? + 4bc = p}. Define a map 0:5 > S by 


(a+ 2c, c, b-a-c) ifa<b-e 
(abe) 4 (2b-a, b, a—b+ec) ifb-c<aK< 2b 
(a— 2b, a—b+c, 6) ifa> 20. 


Then a? = 1, but o has exactly one fixed point, 7.¢., (a,b,c) = 
(a, b,c) (since then one sees b = a and thus a(a+ 4c) = p, implying 
a=1andc= 2+.) Hence, |S| is odd. But then the map given by 
(a,b,c)  (a,c,6) also has a fixed point, which is the solution to 
Fermat’s theorem. 

2. The primes of Z[#] are precisely those numbers of the following form: 
Primes of N congruent to 3. mod 4, and a+bi : a?+0? is prime in Z. 
(Hint: If p = 3 (mod 4) and p= yz in Z[:], then N(p) = N(y)N(z) 
so N(y) = p, impossible.) Show how the prime factorization of 
N(y) in N leads to the prime factorization of y in Z[z]. 

3. Prove that every solution in integers of a? + 6? = c? is given by 
Pythagorean triples. (Hint: It is enough to show this for a,b,c 
pairwise relatively prime. In particular, a or b is odd. Assume a is 
odd. Then 6 is even by Remark 1. Hence c is odd, and 


P= -—a@ =(e+a)(ce—a). 


Write c+a= 2y andc-—a=2z. Thene=yt+2zanda=y-z; 
hence y,z are relatively prime. But yz = (2) implies y = u? and 
z =v? where w = 4. Hence, c= u? + v? and a = wu? — v?.) 

4. If p is a primitive cube root of 1 then Z[p] satisfies the Euclidean 
algorithm, with the degree function N( ). (The proof is analogous 


to Theorem 3). What goes wrong with this proof for other primitive 
roots of 1? 

. Prove Claim 3, by induction on m. First note that m > 2, using 
Claim 1. But 


—ugma? = b° +e? = (b+ e)(b + pe)(b + p’e), 


so q? divides at least one of the factors; assume q?|(b + ¢). Then, 
as in the proof of Claim 1, q { (b+ pic) for 7 = 1,2. implying 
g"(b+0), ie. 


b+e=mqg "f?, b+pc= mag, b+p?c= usqh* 
for suitable f,g,h relatively prime in Z[p] and wu; invertible. But 


0 = (b+e)+ p(b+ pe) + p?'(b+ pc) 


3m—2 f? 


= m9 + purgg? + prusgqh?. 


Dividing through by puzq yields 
C= ger 4g +ulh 

with u',u'’ are invertible. Hence g? = —u!'h? (mod gq). implying 
g = +h (mod gq) and thus g? = +h? (mod q*), so vu’ = +1 
(mod q*). If one can prove uw!’ = +1; then replacing h by u!’h would 
yield 0 = u'¢g?~ fF +4? +h, contrary to the induction hypothesis. 

Thus, it remains only to show if uv” is invertible and wu!’ = +1 
(mod q*), then wu” = +1. Write u” = m-+np for suitable m,n in Z. 
Then 1 = N(u!’) = m? +n? — mn; thus, 4 = 4m? + 4n? — 4mn = 
(2m—n)?+3n?, implying |n| < 1; the possibilities for wu!’ are £1, +p, 
and +(1+ )). (Note 1+ p= —p’.) Of these clearly £1 are the only 
ones congruent to +1 (mod q*), as desired. 
. Prove Fermat’s Last Theorem for n = 4, by reducing to Pythagorean 
triples. This method enables one to prove more generally that there 
is no solution to at + b4 = c?. 
_ Ifn=st anda" +b" =c” then (a*)* + (b*)' = (c*)*; conclude from 
Exercise 6 that to prove Fermat’s Last Theorem it suffices to prove 
the case for n an odd prime. 
. If pis a primitive fifth root of 1, then Z[p] has an infinite number 
of invertible elements. 
. For any primitive nth root p of 1, define Z[p] = {WP aip* : 
a; € Z}. Show that Z[p] is closed under multiplication and thus 
is a subring of C. There is an onto ring homomorphism Z[2] > Z[p| 
given by z+ p; what is the kernel? Conclude that Z[p] is Nocther- 
ian, even though it is known that Z[p] need not be a UFD. 


CHAPTER 20. IRREDUCIBLE POLYNOMIALS 


Having seen that irreducible polynomials play a special role in the polyno- 
mial ring Fz], we should like to be able to determine which polynomials are 
irreducible. Later on we shall see that every irreducible polynomial gives 
rise to a field, thereby enhancing our interest in irreducible polynomials. 

It is convenient to say a polynomial p is “over” a ring R when its 
coefficients all lie in R, i.e., when p € R[x]. Clearly, every polynomial of 
degree 1 over an arbitrary field F is irreducible; such polynomials have the 
form az+b for a,b € F, and are called linear. In case F = C, all irreducible 
polynomials are linear, by the celebrated Fundamental Theorem of Algebra 
(Theorem 25.30). However, the situation is completely different for F = Q, 
where it is virtually hopeless to try to classify the irreducible polynomials. 
Instead we shall give several criteria for irreducibility, and from them we 
shall derive several interesting classes of irreducible polynomials over Q, 
including those that arise from roots of 1. 

First note that, dividing through by the leading coefficient, we may 
assume that our polynomial f is monic. Also recall Corollary 18.4: f has a 
factor z —a iff ais a root of f. This is enough for us to check irreducibility 
of any polynomial f of degree < 3; if f = gh with g,h nonconstant, then 
g or his linear. We shall also consider larger degrees. 

To determine that a polynomial f over Z is irreducible over Q, it often 
turns out to be easier to check irreducibility of f over Z and then to appeal 
to some general theory that we are about to develop. However, over Z 
we cannot pass automatically to monics, and we are confronted with the 
difficulty that 27? +2 is irreducible over Q but has the proper factorization 
2(x? +1) over Z. To isolate this problem we introduce the following concept, 
applicable more generally to any UFD that is not a field. 


Definition 1. The content c(f) of a polynomial f (over R) is the “greatest 
common divisor” (gcd) in R of the coefficients of f (cf. Remark 16.25). 


Strictly speaking, the gcd is defined only up to associate; for example, 
if R= Z and f = 27° —- 227 +6, then c(f) = +2. We say f is primitive, 
and write c(f) = 1, if its coefficients are relatively prime, in the sense that 
no noninvertible element of R divides all the coefficients of f. Clearly, any 
monic polynomial is primitive. In general, we see 


Remark 2. If Ris a UFD, then any f in R[a] can be written as cf,, where 
c = c(f) and f; is a primitive polynomial. Conversely, if f = cf, with 
fi primitive, then c = c(f). (Alternatively, if r € R and r|f in R[x], then 
rlc(f) in R.) 
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Remark 3. If f = gh in R[x], then f(0) = g(0)h(0), and the leading coeffi- 
cient of f is the product of the leading coefficients of g and h. 


Our main goal is to show in general that factorization in R[z] boils 
down to factorization in R coupled with factorization into primitive poly- 
nomials. We need a structural preliminary. 


Remark 4. There is a natural homomorphism y: R[x] > (R/Rp)[z], given 
by ieat 4% Ye; + Rp)2’. (Indeed, let 4: R — (R/Rp)[z] be defined 
as the composition of the natural homomorphism R > R/Rp with the 
natural injection R/Rp > (R/Rp)[z]. By Lemma 18.3, 4 extends to a 
homomorphism yg: R[x] > (R/Rp)[z] satisfying p(«) = x.) 

Now kerg = {Yo ejr" : each c; € Rp} = R{[z]p; thus, R[z]/R[x]p & 
(R/Rp)[x], by the first Noether isomorphism theorem. 


LEMMA 5. If R is an integral domain, then any prime p of R is also a prime 
of Ria]. 
Proof. By Proposition 17.6, we need to show that if R/Rp is an integral do- 


main, then so is R[z]/R[x]p. But (R/Rp)|[2] is certainly an integral domain, 
which is isomorphic to R[2]/R[z]p by Remark 4. 0 
Polynomials over UF Ds 


The groundwork is now laid for a major result. Throughout, we assume 


that R is a UFD. 


LEMMA 6. (Gauss’s lemma.) If f,g are primitive polynomials over R, then 
fg is also primitive over R. 


Proof. Otherwise c(fg) has some irreducible factor p, so p|fg. But p is 
prime, by Remark 16.25. By Lemma 5 we see p|f or plg, t.e., ple(f) or 
plc(g), contrary to f and g primitive. 0 


COROLLARY 6. e( fg) =c(f)c(g). 


Proof. Let c = e(f) and d = e(g). Then f = cf, and g = dg, for suitable 
primitive polynomials f1,91; thus, fg = (ed) fig: implying cd = c(fg) by 
Remark 2. 0 


CoROLLARY 6". Suppose f,g € Riz], and write f = c(f) fo and g = c(g)go. 
Then flg, iff c(f)|e(g) in R and folgo in R[x]. 


Proof. (<) is clear. (=) Write g = hf. Then e(g) = c(h)e(f), and canceling 
the contents from each side yields go = ho fo. 0 


To proceed further we pass to the field of fractions F of the integral 
domain R, by means of the following observation. 


LEMMA 7. For any f in F[2] there is some s #0 in R for which sf € R[z]. 
Proof. Write f = ee Six’ where c;,d; € Rand d; #0. Take s = dy... dk: 


1=0 d; 
evidently all the coefficients of sf are in R.O 
THEOREM 8. Suppose R is a UFD whose field of fractions is F. If f = gh 
for f,g € R[x] with g primitive and h in F[z], then h € R[z]. 


Proof. sh € R[x] for some s in R, and 
se(f) = elsf) = elgsh) = o(g)e(sh) = e(sh), 


since g is primitive. Thus, s|c(sh) so each coefficient of sh is divisible by s, 
implying h € R[x].0 


COROLLARY 9. Suppose R is a UFD, and f € R[z] is primitive. Then f is 
irreducible in F [x] iff f is irreducible in R[x]. 


Proof. (=>) If f = gh in R[z] then g or h is constant; assuming g is constant 
we see g divides c(f) = 1. 

(<) Suppose f = gh in F[z]. Taking s in R for which sg € R[x] we 
can write sg = cig, where c, € Rand g; € R[x] is primitive; then f = gihy 
where hy = s-'eyh € F[z]. But then hy is in R[x] by Theorem 8, and is 
primitive, by corollary 6’. Hence g; or hy is invertible and thus constant, 
implying that g or h is constant. O 


Note 9'. Actually we showed that for f € R[a], any factorization f = gh 
in F[a] has a factorization f = g,hy in Riz], with g; primitive and equiva- 
lent to g in F[z].It follows at once by induction that any factorization of f 
in F[a] has an equivalent factorization with each factor in R[z]. 


Example 9". Let us pause for a moment to consider rational roots of poly- 
nomials. Suppose @ is a root of a polynomial f € Q[2], with m,n relatively 
prime. Then nx — m divides f over Q, and thus over Z. But nx — m is a 
primitive polynomial, implying n divides the leading coefficient of f, and 
m|f(0), cf Remark 3. In particular, this gives us a finite procedure for 
finding all rational roots of f. 


Theorem 8 implies that the irreducible elements of R[a] are precisely 
the irreducible primitive polynomials and the irreducible constants in R. 
Although not the focus of the present discussion, the following result is now 
rather straightforward. 


THEOREM 10. If R is a UFD, then R[x] is a UFD. 


Proof. The existence of a factorization for any f in R[z] is clear: Writing 
f=c(f)fo in R[x] with fo primitive, we factor c(f) into irreducibles in R, 


and, by note 9’, we can factor fo into irreducible polynomials in R[x] which, 
in view of Corollary 6’, are primitive. In view of Proposition 16.20 it remains 
to show that this is a factorization into primes. On the one hand, any 
irreducible element of R is prime in FR and thus prime in R[z], by Lemma 5. 
On the other hand, any primitive irreducible polynomial is irreducible in 
F|a] and thus is prime. 0 


For example, 62? — 6 factors in Z[z] as 2-3- (#2 — 1)(x +1), where 
the only possible modification is multiplication by +1. Theorem 10 can be 
pushed a bit further, cf. Exercise 3. Applications are given in Exercises 7ff. 

As mentioned earlier, Noetherian rings have replaced unique factoriza- 
tion domains as the focus of commutative ring theory, and the analog of 
Theorem 10 for Noetherian rings (cf. Exercise 17.21) is much more impor- 
tant in current research. 


Eisenstein’s Criterion 

By Corollary 9, irreducibility of polynomials over Z leads to irreducibility 
over Q. However, we do not yet have general techniques to establish irre- 
ducibility of polynomials of degree > 4. An easy structural argument yields 
an infinite class of irreducible polynomials over Q, of arbitrary degree. 


THEOREM 11. (Eisenstein’s Criterion.) Suppose f € Zz]. If some prime 
p in Z does not divide the leading coefficient of f but divides all other 
coefficients of f, and if p? { f(0), then f is irreducible. 


Proof. Otherwise factor f = gh, where deg g = m > 0 and degh =n > 0. 
Then deg f = m+n. Writing” for the image in Z,[z] obtained by taking 
each coefficient mod p (ef. Remark 4), we have 


are’t? = f=gh. 
where a is invertible in Z,. Then 
m+n= deg g+degh < deg g+degh=m-+n: 


thus equality holds at each stage, implying degg = m and degh = n. 
Thus, by unique factorization of the polynomial xt” over the field Z,, 
we have g = ux” and h = vx" for suitable u,v 4 0 in Zp. But this 
means g(0) = 0 = h(0), so p divides g(0) and h(0), and thus p? divides 
g(0)h(0) = f(0), contradiction. Hence f is indeed irreducible. 0 

Note that this criterion does not make sense in Z, and is useless there; 


for example, 2? + 3 = (x — 2)(x% +2) in Zz. Thus, in the remainder of this 
discussion, we restrict our attention to polynomials over Q. 


Example 12. (i) For any prime p in Z and any ¢ > 0, the polynomials xt —p 
and «* + p are irreducible. 

(ii) Of course, 2? — p? = (41 +>p)(z —p) is reducible; likewise x' — p! is 
reducible for each t. 

(iii) Although it does not satisfy Eisenstein’s criterion, x? +p? is irre- 
ducible in Q[z], since it has no roots in Q. What about 2* + p?? Let us 
turn to Z[:] for help. 24 +4 = (x? — 2i)(2? +22). But 27 = (1+7)*, and 
—2i = (i(1 + 7))? = (—1+7)", so 


ti+4= (24+ (14+i))(z2 — (1+7%))(2 + (-142))(2 — (-142)) 
=(n+1+i)(2+1-i)(2 —14%)(2-1-i) 
= ((v +1)? +.1)((@ — 1)? +1) = (2? +. 22 4 2) (2? — 22 4-2). 


This factorization exists over Q because of the fluke that 27 is a square 
in Q[i]. A similar analysis for p £ 2 actually shows that 2*+p? is irreducible 
for all p #2 (cf. Exercise 2). 


Our next application is quite cute. We saw that every primitive nth 
root p of 1 satisfies the polynomial #” — 1. But 2” — 1 = (a —1)(2™7t+ 
-+++4+ 1) is reducible, and we should like to find some irreducible factor that 
p satisfies. The obvious candidate works, for n prime. 


THEOREM 13. For p prime the polynomial f = 2°71 + 2?-?7 +---+1 is 
irreducible. 


Proof. We would like to be able to apply Eisenstein’s criterion. If f = gh 
then f(2@+1) = g(#+1)h(2 +1), as seen by the substitution lemma (18.2). 
Thus it suffices to prove f(z +1) is irreducible. But (x — 1) f(z) = 2? - 1. 
Applying 2 «+1 yields 


Pp 


af(@+1) = (#41)? -1L=2?4+ px?" 4 (Byer tne 
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Pp 


f(e+l= P14 pgPo? + @ P34... 4p. 


To prove Eisenstein is applicable it suffices merely to show p\(®) for all 
1<i<_p. But this is obvious since in (?) = em the prime p appears 
in the factorization of the numerator but not in the factorization of the 
denominator. 0 


Here is another instance of the same trick. 


Example 14. The polynomial f = 82° —6z7-+1 is irreducible over Q. Indeed 


f(a+ 1) = 8(2 +1)? — 6(2 + 1)? +1 = 82? + 182? + 12243 


is irreducible, by Eisenstein. Likewise g = 82° — 6x? — 1 is irreducible. 


(Why?) A less elegant way to check that these polynomials are irreducible 


would be to show they have no rational roots, by means of Example 9”. 


Exercises 


Le 


Suppose f € R[s]. If z is a nonreal root of f, then (a — z)(#— Zz) = 
x? — (z+ 2Z)x + 22 is an irreducible factor of f in R[z]. 

Let f = 24+ p? for p > 2. Factor f completely over C, and match- 
ing complex conjugates, factor f as a product of two irreducible 
quadratics over R. Since these factors are not in Z[z], conclude that 


f is irreducible over Q. 


Nagata’s Theorem and Its Applications 

(Nagata’s Theorem.) Suppose FR is an integral domain, and S is a 
multiplicative subset consisting of products of prime elements, such 
that R[S~'] is a UFD. If every element of R can be written as a 
finite product of irreducible elements then R is a UFD. (Hint: An 
irreducible of R either divides an element of S and so is prime, or 
does not and stays irreducible in S~!R and thus is prime.) 
Reprove Theorem 10 using Nagata’s Theorem. 

The “circle ring” F[z, y,2]/{x? + y? + 2? — 1) is a UFD. (Hint: 
Noting 7? +y? = (1+ z)(1—z), localize at 1—z and apply Nagata’s 
theorem.) 

State and prove Eisenstein’s criterion for an arbitrary UFD. 


The Ring Z[z,...,2,] and the Generic Method 

Suppose C' is a UFD. Then the polynomial ring C[21,...,2,] in 
n indeterminates is also a UFD; likewise, the polynomial ring in an 
infinite number of commutative indeterminates over C is a UFD. 
(Substitution in several indeterminates.) Prove, for any commuta- 
tive ring C' and elements {e1,...,¢n} in C, that there is a unique 
ring homomorphism f: Z[x1,...,2,] 3 C, such that f(2;) = ¢; for 
L<i<n. 

Over Z[a41,...%,] the determinant d of the matrix 


1 1 ate - db 


10. 


LH: 


equals TT, <j; 2;<,(#i — 2). (Hint: Subtracting the jth row from the 
ith row of A yields the row 


2 n 
Li Ly Ly Ly et Di, — Dy, 


each term of which is divisible by (2; —2;). By unique factorization, 


I] @-2alé 


1<j<i<n 


prove equality by matching degrees and the coefficient of the mono- 


mial ori}... 2?_,.) 
(The Vandermonde Determinant.) For any commutative ring C, 
and any c1,...,€n, in C, the matrix 

1 1 1 

C1 C2 en 

n-1 n-1 n—-1 

Cy C5 ce 


has determinant --.¢,(c; — ¢;). (Hint: Apply exercise 8 to ex- 
1<j<i<n ] 

ercise 9.) 

(Symmetric polynomials.) Any permutation m of {1 2 ... n} in- 

duces a homomorphism 9,:Z/r,,...2,] 3 Zl[x1,...2,] given b 
Pp p ’ ’ & y 

Pr(X;) = Bei. We say f € Zlxy,...2%,] is a symmetric polynomial if 

pn(f) = f for every permutation 7. For example, one can write 


(2 —21)...(%— 2,) = 2" — 8,2" 1 4 892"? —---+ts, 
where s; are the following symmetric polynomials: 


8) =X +++ +2n, 


82 = 1% 4+71%3 +273 4+..., 


Sy Ta eet 


81,....5n are called the elementary symmetric polynomials in n in- 
determinates. Prove that any symmetric polynomial f can be writ- 
ten f = h(s1,...,8n), for a suitable polynomial h. (Hint: Induction 
on n and on the total degree of f. Write f for f(z1,....¢n—1.0) 
in Z[z1,...2n-1]. If f is symmetric, then f is symmetric in n — 1 
indeterminates, and in fact the elementary symmetric polynomials 


12; 


13. 


in n — 1 indeterminates are 3),...,3,-1. (Note §, = 0.) By induc- 
tion, f = 9(31....,3n—1) for suitable g. Then f — g(s1,...,8n—1) is 
divisible by z, and thus by each z;. Write f—g(s1,....8n—1) = 8nq 
and apply induction to q.) 

Given n, denote the symmetric polynomial 2 +2$+---+a! by fi. 
Verify the following formulas: 


2 z 2. 1¢8 . 
fo = 81 — 280: fs = 8| — 351582 + 383; 


fa= st _ 47s» + 48183 + 285 — 484. 


More generally, prove the following recursive formulas, known as 
Newton's formulas: 


fe =8ifh—1 — 82 fh—2 Fe + (SA) sp aFi oh (—1)**1ks;, 
(for k <n); 
tock Sst eee ts ee Ey eats 


(Hint: Working in F[x1,...,2,], take another indeterminate y and 
let g = []L, 0 - ay) = SSp_,(—1)* sey”. The formal logarithmic 
derivative developed as a formal power series yields 


n 


de SS enloay) Saar 
k=0 


i=1 


cf. Exercises 16.8ff, 16.12, and 16.15. Thus, q! = — 3p, afrsiy”: 
match coefficients of y*~1.) 
If dis the Vandermonde determinant of exercise 10, prove d? is the 


determinant of the matrix 


n fi Livi OS geil 

fi fo mute, Sa 
B= : : b, : 

a re 


(Hint: |B| = |AA*| = |A|?.) Coupled with exercises 11,12, this gives 
an explicit computation of d? in terms of the elementary symmetric 
polynomials. 
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Review Exercises for Part II 

1) Prove or give a counterexample:The union of two ideals is an ideal. 
2) List the ideals of Z09 

3) Find an integral domain which is not a field, but which contains C. 
4) Find rings R, CR, CR, CQ. 

5) Prove or disprove: Z[1/16] = ( Z[1/12] ) [1/13]. 


6) Find (with proof) all the subrings of Z[1/16]. 


Tif p=a’ +b’ =c’ +d’ is prime with a,b,c,d EN and a<b and c<d, 
then a=c and b=d 


8) Define “prime” and “irreducible” (elements), and show that every prime element 
is irreducible. 


9) Give an example of a prime nonzero ideal which is not a maximal ideal. 

10) Show f = 8x° —6x° +1 is irreducible over Q, without using Eisenstein’s criterion. 
11) Give an example of a UFD which is not a PID. 

12) Show that 9 has two inequivalent factorizations in z[V10 . 


13) Prove that the ring of Gaussian integers Z[i] is a UFD. 


14) Prove that every natural prime number of the form 4k+1 is a sum of two squares 
15) State and prove Gauss’ lemma. 


16) Determine the primes of Z[i]. 


17) Prove the formula d)=n, where @ is the Euler function. 
d| 


18) In the ring Z[r] where r is a primitive cube root of 1, show that: (i) 1-1 is irreducible; 
(ii) 1-r is prime; (iii) 3 is reducible. 


19) Prove (1—r)...1—r"') =n, for any primitive nth root of 1. 


20) Factor the following polynomials over Q: 
x41 x8 42,07 44,05 46,x°4+8,0°49. 


PART IIT — FIELDS 


Our previous applications of abstract algebra have been to discover prop- 
erties of integers and rational numbers, using the minimum possible com- 
putation. We turn now to the main goal of this course, which is to study 
irrational numbers, in particular, roots of polynomial equations. Since this 
will involve some rather difficult and abstract theory, we pause to survey 
the history of (polynomial) equations, preceding the revolutionary work of 


Galois. 


Historical Background 

Solving equations has been one of the principal occupations of mathe- 
maticians since the time of the Pythagorean school, although their methods 
were based on geometric reasoning. The name algebra itself derives from 
the famous treatise on equations, called ” Al-jabr w’al mugabala,” written 
in 830 by Mohammed ibn Musa al-Khowarizmi. 

It is natural to start with polynomial equations whose coefficients are 
integers. The simplest case is the linear equation az + b = 0, which has the 
root 7 = -4 € Q. Note that any linear equation with coefficients in a field 
F has its root in F’, so we are led to consider equations with coefficients in a 
given field F. In general, since any root of a polynomial f(z) is also a root 
of the polynomial obtained by dividing through by the leading coefficient 
of f, we might as well assume from the onset that f is monic. 

The degree 2 case is not quite as trivial. Every high-school student has 
had to struggle with the “quadratic formula,” which says that the roots of 


the equation xz? + bz +c = 0 are eye In other words, any quadratic 


equation over a field can be “solved” by taking one square root. The well- 
known method of obtaining this solution is by “completing the square”; 
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b? —4e 
4 


b 
23> 


i.e., letting y = z+ 2, one sees y? = —e+ we = , so one solves by 
taking square roots. 
For perhaps a thousand years or more, one of the major research ques- 


tions in mathematics was to solve the general cubic equation 
ae +62? +cr+d=0. 
In analogy to the quadratic case, one can obtain the simpler equation 
y +pyt+q=0, 


by taking y= 2+ om but further reductions are rather tricky. The famous 
poet Omar Khayyam found a geometric solution for a wide range of cubic 
equations, but a general method was not discovered until the sixteenth 
century. In 1515, Scipione del Ferro apparently solved the cubic (for p > 0 
and q < 0) , but only disclosed his findings to a select group of pupils. 
Challenged by a former student of del Ferro, Niccolo Tartaglia discovered 
the solution to the cubic but also kept his results secret. He finally divulged 
them in confidence to Cardan, who published the general solution in his 
major work Ars Magna sive de Reglis Algebraicis (“The Great Art — or 
the Rules of Algebra”). Clearly, it is enough to give one root, for then one 
is left with solving the other two roots in a quadratic equation. The root 
of y? + py +q =0 given in Cardan’s book is 


af —4 PAP FON? 3 ag P\? | 4)? 
i a) eG) a AG) a) 
Note that for gq = 0, one gets y = 0; for p = 0, one gets y = — ¢/q, as one 
would expect. However, the conscientious reader will find some difficulties 
with this formula, since it often gives an unexpected or unrecognizable root. 
(Try solving 2° + 2 = 2 or 2° — 152 = 4.) 
The quartic equation was solved by Ferrari, a student of Cardan, using 
a method reminiscent of completing the square: To solve y*+py?+qy+r = 
0, one notes 
2,P 2 = py? 2 2 
(y +f 4u) =-qy-Tt+ (5) + Quy" + puta’, 
with u to be determined. Considering the terms in y and y? in the right- 
hand side, one sees this can be a square of a polynomial in y, if and only if 
it is the square of V2uy — an in which case we could then take square 
roots and solve. But for this to happen, we must have 


2 


zs EY a ee 
r+(5 Ep ire er 1.€., 


Sue + 8pu? + (2p? — 8r)u- g = 0, (1) 


which can be solved via Cardan’s formula for cubics. Thus the quartic is 
solved by means of (1), which is called the “resolvent cubic.” 

For some time, it was thought that an inductive procedure could be es- 
tablished to solve an equation of arbitrary degree n in terms of a “resolvent” 
polynomial of degree < n; in fact a number of papers proposing solutions 
to the quintic were submitted, but each of them was found eventually to 
contain some error or another. (Unfortunately, the resolvent polynomial of 
the quintic turns out to have degree 6.) Finally in 1799 Ruffini published 
a proof that there is no general formula to solve equations of degree 5. 
His proof required more than 500 pages, and was met with skepticism; in 
1824 the brilliant Norwegian mathematician Niels Henrik Abel settled the 
question (negatively) to everyone’s satisfaction, by finding certain clearcut 
algebraic criteria which would follow from a formula, but which are violated 
by the “generic” equation of degree > 5. 

In 1831, a year before being killed in a duel (at the age of 21), Evariste 
Galois carved his niche for mathematical posterity by sketching his theory 
of solving polynomial equations, thereby clarifying the formulas for poly- 
nomials of degree 3 and 4, as well as reproving the theorem of Ruffini and 
Abel.? 

One of the triumphs of early 20th-century-algebra was to explain Galois 
theory in terms of the theory of fields, groups, rings, and vector spaces. We 
start with a polynomial f over a field F and take a larger field K which 
contains a root of f. Thus the study of field extensions K D F is basic 
to the theory, and a superficial discussion of properties of vector spaces 
suffices to dispose of various famous geometric questions considered by the 
Greeks, as will be seen in Chapter 22. (However, to answer one of these 
questions we need to know that 7 is transcendental, and this deep theorem 
is deferred until Appendix A.) 

Our next step, in Chapter 23, is to utilize ring-theoretic tools from 
Part II, and consider a field & that contains all of the roots of f; the theory 
is applied to determine all finite fields (Chapter 24). The set of isomor- 
phisms from FE to F that fix F turns out to be a group, with respect to 
composition of functions, called the Galois group; its subgroups determine 


1Unfortunately, Galois’s memoir was rejected by the French Academy, but, fortu- 
nately, the prominent French mathematician Liouville went over the details of Galois’s 
memoir and arranged for it to be published posthumously in 1846. A fascinating account 
of this subject, including an analysis of all major contributions leading to Galois’s crown- 
ing achievement, including the material sketched here, can be found in Galois’ Theory 
of Algebraic Equations, by Jean-Pierre Tignol, Longman Scientific and Technical, Essex, 
England (1988). 


all fields between F and F, via the celebrated “Galois correspondence” of 
Chapter 25. A host of applications follows in Chapter 26, culminating in a 
determination in Chapter 27 of precisely which polynomials can be solved 
in terms of explicit formulas. In particular, one can show with a minimum 
of computation that every equation of degree 4 can be solved in such a 
way, whereas there are equations of degree 5 and larger that cannot be so 
solved. Appendix A contains a complete proof that a is transcendental, 
and Appendix B contains a brief discussion of how fields, polynomials, and 
other topics of this text generalize to noncommutative algebras. 


CHAPTER 21. FIELD EXTENSIONS: CREATING ROOTS OF POLYNOMIALS 


We start with a given base field F, which is to contain all the coeffi- 
cients of the equation to be studied, and build up from there. 


Definition 1. An F-field is a field K that contains F’; we also say that K is 
a field extension of F, denoted as K/F (not to be confused with the other 
uses of this notation). 


Example 2. Some examples of field extensions: R/Q, C/Q, and C/R. 


Actually, although K will almost always be a field, it could be any 
integral domain containing the field F, and we assume this throughout the 
discussion. (We shall need this greater generality for technical reasons, but 
at the end we reduce to the field extension case, in remark 13 (ii).) 


Definition 3. If a € K and f = Ylajx* € Flx], we write f(a) for the 
evaluation SJaj;a’ in K. Let Z, ={f € F[z] : f(a) = 0}, easily seen to be 
an ideal of F[z]; these f are called the polynomials satisfied by a. We say 
ais algebraic over F when ZT, 4 0; we say a is transcendental over F when 
Ta = 0. 


Whereas an algebraic number often can be described rather concretely, 
such as 2, it is not easy to pinpoint any single transcendental number. 
The most important examples of transcendental numbers (over Q) are 7 
and e (the base of the natural logarithms), although the proofs are rather 
involved, cf. Appendix A. The easiest explicit number to prove transcen- 
dental is given in Exercise 1, but by far the most straightforward way to 
prove that transcendental numbers exist is to show that “most” real num- 
bers are transcendental, by means of set theory (cf. Exercises 2ff)! 


Algebraic Elements 

For the remainder of this discussion, we turn to the algebraic case (Z, 4 0). 
Write (f) for the ideal F[2]f. Since F[z] is a PID, we see Z, = (fa) for a 
suitable polynomial f, 4 0. Actually we know already from Remark 16.10 
that if also Z, = (g) then g = af, for suitable a # 0 in F (since all 
invertible elements of F[z] are constants). In particular, dividing through 
by the leading coefficient we have a unique monic f,, called the minimal 
polynomial of a. Note that if a 4 0, then f, is nonconstant so deg f, > 1. 
If n = deg f, we say ais algebraic of degree n. 


Remark 4. deg fy is the least degree among all polynomials satisfied by a. 


Let us pause for one familiar example, now, and postpone the others 
until later. C is an R-field. Let 2 = Y—-1 €C. The minimal polynomial 
of iis 2? +1. Every element of C has the form 2+ yi for z,y in R. 


In this example, we realize at once that #?+1 is the minimal polynomial 
since its degree is only 2. In general, it is useful to have a structural criterion 
for the minimal polynomial. 


Remark 5. Suppose a € K. If f = gh € F[x] then f(a) = g(a)h(a) (as can 
be seen directly or by Remark 18.2). 


PROPOSITION 6. Suppose a € K, and f is a monic polynomial satisfied 
by a. Then f is the minimal polynomial of a, iff f is irreducible. 


Proof. (=) Suppose f = gh. Then 0 = f(a) = g(a)h(a) implying g(a) =0 
or h(a) = 0. But f has minimal degree in Z,, so either deg f = deg g or 
deg f = deg h; we conclude f is irreducible. 

(<=) Let f, be the minimal polynomial of a. Then f € (f,) so f = gfa 
for some gin F[z]. But deg f, > 1 so g must be constant, 7.e., f is associate 
to fa, implying f = fa, since both are monic. 0 


Given a field extension K/F, we want to compare the structure of the 
fields K and F. There are two main methods to do this: The observations 
of this chapter will be obtained by viewing K as a vector space over F; 
later, we shall obtain deeper results by means of ring homomorphisms. So 
first we review some linear algebra. 

A vector space over a field F is an Abelian group (V,+) endowed with 
scalar multiplication F x V — V satisfying the following axioms, for all 
a,;in F andv;inV: 


lv=v 
(ay + ag)u = ayv + agu 
a(vy + v2) = avy + avy 


(Q1Q2)v = ay (agv). 


A subset SC V is called linearly independent, or F-independent, if 
Ss ajs; =0 (for distinct s; € S) necessarily implies each a; = 0. A base of 
the vector space V is a linearly independent set B that spans V, 1.e., every 
element of V is a linear combination of elements of B. The number of 
elements of a base B is called the dimension of V, denoted here as [V : F], 
and is known to be independent of the particular choice of B. If B is an 
infinite set we write [V : F] = oo. 

Any ring K > F can be viewed as a vector space over the field F, with 
respect to the given addition and multiplication in K. (The vector space 
axioms then follow as a special case of the ring axioms). Nevertheless, we 
maintain our additional assumption that K is an integral domain, in order 
to avoid contending with situations such as in Exercise 18. 


Given a in K, let Fla] denote the image of F[z] in K, under the 
substitution 7 +> a, 7.€., 


t 
Fla] ={>_ aja’: tN, a, € F}. 


i=0 


Clearly Fla] is an integral domain, being a subring of the integral domain 
K. Our immediate interest in F[a] is as a subspace of K as vector space 
over F. 


Remark 7. Fla] is spanned over F by B = {1,a4,a7,...}, so some subset 
of B must be a base. B is itself a base iff a is transcendental over F. 


PROPOSITION 8. Suppose a € K is algebraic of degree n over F. Then 
[Fla]: F] =n. In fact 1,a,a7,...,a"~1 are a base of Fla] over P. 


Proof. Independence of {1,a,a7,...,a"~1+} is clear, for if oe a;,a' = 0, 
then a satisfies the polynomial as a,x, which must be 0 since its degree 
is less than that of f,; hence each a; = 0. 

It remains to show that 1,a,...,a"~1 span Ffa], i.e., that F[a] = V 
where V = wea Fa‘. We must show a typical clement b = 0", aja’ 
of F[a] belongs to V. This follows from the Euclidean algorithm for F'[z]. 
Indeed b = g(a), for g = 77, az". Writing g(x) = q(x) fa(x)+7(2), where 
degr < deg fy or r = 0, we see that r(a) = g(a) = b, but by inspection 
ra)eEV.O 


The converse is also true. 
Corotiary 9. If [Fla]: F] =n, then a is algebraic of degree n over F. 


Proof. 1,a,...,a@” are dependent over F (since these are n +1 elements); 
thus 37", a,a° = 0 for suitable a; in F, not all zero, implying a satisfies 
the polynomial f = S77, aixz'. Now Proposition 8 implies [F[a] : F] is the 
degree of a. 0 


Actually, it is important to know that Fla] is always a field whenever a 
is algebraic. This is seen by a surprising application of the structure theory. 


Remark 9. If f 4 0 is an irreducible polynomial of F[z], then (f) is a 
maximal ideal of the principal ideal domain F[z], so F[z]/(f) is a field. 


THEOREM 10. Suppose a € K 1s algebraic over F. Then 
(i) Fla]/Z, Fla], under the substitution homomorphism (x + a); 
and 


(ii) Fla] is a field. 


Proof. (i) The substitution homomorphism ¢,: F[z] > Fla] is onto, with 
kernel T,, 80 ta: F[a2]/Z. 4 Fla] is the desired isomorphism, by Noether I. 

(ii) Za = (fa), and f, is irreducible by Proposition 6. By Remark 9’, 
F\a]/Z,, is a field, which by (i) is isomorphic to F[a]. 0 


Summary 11. The following statements are equivalent, for any element 
ain K: 

(i) a is algebraic over F; 

(ii) [Fla] : F] < co; and 

(iii) @ is contained in some field that is finite dimensional over F. (In 
fact this field could be taken to be F[a].) 


Let us look at some familiar examples. Although one can demonstrate 
directly that F[a] is a field, direct computations sometimes would be quite 
intricate. 


Example 12. (i) The special case a € F. Then Fla] = F, so [Fla]: F] = 1, 
the base being {1}. Note that f,, = 2 — a is linear. In general, [K : F] = 1 
iff K = F. This example is rather trivial. 

(ii) C = R[#], where i = —T satisfies the polynomial x?+1. [C : R] = 2, 
the base being {1,7}. 

(iii) a = V2 satisfies the polynomial «?—2, which is irreducible and thus 
must be the minimal polynomial of a, implying Q[V2] is two-dimensional 
over Q, with base 1, V2. 

(iv) In general, we say K/F is a quadratic extension if [K : F] = 2. If 
a € F is not a square in F then F[,/a]/F is a quadratic extension, with 
base {1, fa}. and the minimal polynomial of a is 2? — a. Conversely, 
suppose F is a field in which 1 +1 4 0. We claim that any quadratic 
extension K/F can be written K = F[,/a] for suitable a in F. This is 
seen by the familiar argument from high school of “completing the square.” 
Indeed, take any a in K \ F. Then 1 < [Fla]: F] < [Kk : F] = 2, implying 
[Fla] : F] = 2; hence Fla] = K, and the minimal polynomial of a has the 
form x? +a,2 +a 9. Thus a? + aya +a = 0, implying 
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(a+ Sy =a? +004 =-a+ Se F. 


Thus, letting a = —ao + a we see K = F[,/a]. Note that we required 


1+1 4 0in order that $ make sense; this idea will be pursued in Chapter 23 
when we study the characteristic of a field. 


(v) Generalizing (iii) in another direction, we see that the polynomial 
xz" —»p is irreducible over Q for any prime number p, by Eisenstein’s criterion, 
implying Q[ ¥/p] is an n-dimensional field extension of Q. 

(vi) Let p be a primitive cube root of 1. Then the minimal polynomial 
of pis 27 +2”+1. (Indeed 0= p? —1=(p—1)(p? + p41), implying that 
p satisfies the polynomial 2? -+2+1, which is irreducible by Theorem 20.13. 
Clearly, [Q[p] : Q] = 2, and in fact Q[p] = Q+ Qp.) 

(vii) More generally, if p is a primitive pth root of 1, for p prime, then 
[Q[p] : Q| = p — 1; proof as in (vi). (The nonprime case will be handled in 
Example 25.12.) 


Note that the cases [K : F] = 1,2 have been described completely in 
(i) and (iv), except when 1+ 1=0 in F. 


Finite Field Extensions 

Next we consider all the elements of K at once. We say K/F is algebraic if 
each element of K is algebraic over F; also, K/F is finite if [K : F] < oo. 
(Of course, K/F can be finite even if F and K both are infinite as sets; 
e.g., C/R is finite, since it is quadratic.) 


Remark 13. (i) Every finite extension is algebraic. More precisely, if 
[K : F] = n, then each element a of K has degree < n over F, since 
[Fla]: F] < [Kk : Fl] =n. 

(ii) If the integral domain K is algebraic over a field F, then K is itself 
a field. (Indeed, by Theorem 10, Fla] is a field for any a 4 0 in K, so 
a' € Fla] C K.) 


THEOREM 14. If F C K C L are fields, then [L: F] =[L: K][|K: F]. 


Proof. Let {a;: 1 € I} and {b; : 7 € J} be respective bases of K over F 
and of LE over K. We shall show that {a,b; :7€ I, 7 € J} is a base of L 
over F; the result then follows at once. Clearly, 


L= S> Kb; = So) Fai)b; = S> Faibj. 


JES jet iel tJ 


so the a,b; span L over F. It remains to show the a,b; are F-independent. 
If 37 ,,; 2ijaibj = 0, then rewriting this as }7,(}7; aijai)bj = 0 we see each 
2, aja; = 0, since the b; are K-independent. But by assumption the a; 
are F-independent, so each a;; = 0. 0 


This basic result has many consequences, some of which are given in 
Exercises 10 ff. 


COROLLARY 15. Notation as above, [K : F] divides [L: F]; in particular, 
if [L: F] is prime, then K = L or K = F. 


Proof. The first assertion follows at once from the theorem, and the second 
assertion is an obvious consequence. 0 


Given a field extension L/F and a,b € EL algebraic over F, we define 
Fla,b| = (F[a])[b]. Note that K = F[a] is a field, and thus so is Fla, }]. 
Furthermore the minimal polynomial of b over K divides the minimal poly- 
nomial f, of 6 over F, so [Fla,b] : K] < deg fy = deg b. Consequently 


[F[a,b]: F] = [Fla, 6]: K][K : F] < deg b deg a. 


One can iterate this procedure, to define Fla1,...,a:] for any t, whenever 
@1,..., 4 are algebraic over F. (Also see Exercise 10.) 


COROLLARY 16. If a and 6 are algebraic elements of an F-field K, then 
a+b, ab, and ¢ (for b #0) are algebraic. 


Proof. All of these elements are in Fla, 6], which is finite over F and thus 
algebraic over F. O 


COROLLARY 17. For any F-field K, the set of elements of K algebraic 
over F is a field, called the algebraic closure of F in K. 


Occasionally we shall need the following generalization of F[a, b]. Given 
subfields K, £ of a field W, we define the compositum KL to be the inter- 
section of all subfields of W containing both K and L. 


PROPOSITION 18. Suppose F C K,L C W are fields, with [K : F] and 
[L: F] finite. Then KL = {)7,,4. 410i a; € K, b; € L}. 


Proof. Let E = {\l¢,i4,. abi 1a; € K, 6; € L}. Any ring that contains both 
K and FE clearly contains Z, so EZ C KL. But [EF : F] is finite, since one 
gets a finite spanning set by multiplying a base of K with a base of DL, as 
in the proof of Theorem 14. Hence F is a field by Remark 13 (ii), implying 
KL=E.0 


Remark 19. Obviously [KE : L] < [K : F], since any base of K over F 
also spans KL over L. In Exercise 25.2 we shall see that [AKL : L] divides 
[Kk : F. 


Remark 20. Of course, this construction could be generalized to the com- 
positum of any number of subfields; and if K,,..., A; are subfields of W all 
finite dimensional over F, then Ky... Kt = {(Slanite Mis «+ Gi, 2 Gi; © Kj}. 


Exercises 


1. Any real number between 0 and 1 can be described as a string of 
digits following a decimal point. Define Liouville’s number 


a = .110001000000000000000001 00000000000000000... , 


which is given by 1 in the n!-position (for each n) and 0 elsewhere. 
Determine the only possible nonzero positions in a™ for each m, 
and by showing that these do not match, conclude that a is tran- 
scendental. A concise proof using calculus is given in Appendix A. 


Countability and Transcendental Numbers 
Recall that two sets have the same cardinality iff there is a 1:1 
correspondence between them, and a set S is called countable if it 
is in 1:1 correspondence with N. (In other words one can list the 
elements of S as {81,$2,...}.) Sis called uncountable if S is not 
countable. 

2. The union of a countable number of countable sets is countable. 


(Hint: Tf Sy = {81u, S2u,...} and S =US, then 
S={su, S21, $12, $22, 531, $22, Sigs ahs ) 


3. The set of complex numbers that are algebraic over Q is count- 
able. (Hint: Since any polynomial has a finite number of zeros, it is 
enough to show that there are a countable number of polynomials 
over Z. For any polynomial f, define its “index” to be deg f plus 
the sum of the absolute values of the coefficients. There are only 
a finite number of polynomials with a given index, so the set of 
polynomials is countable.) 

4. (Kantor’s Diagonalization Trick) The set S of real numbers between 
0 and 1is uncountable. (Hint: Any real number between 0 and 1 can 
be written in the form .djd2d3 ... where each d; € {0,1,...,9}. Sup- 
pose S were countable, and write the nth real number as .dyydon.... 


Construct a number .djd)... that formally does not appear on the 
list, a.e., di A di; for each 7. (One has to be a bit careful, since, for 
example, .19999--- = .20000...) 


5. There are uncountably many transcendental real numbers between 
0 and 1, by Exercises 3 and 4. 

6. Generalizing the argument of Exercise 3, show that if F C K are 
fields with F infinite, then F has the same cardinality as its alge- 
braic closure in K. 


10. 


11. 


12: 


13. 


14. 


15. 


16. 
17. 


18. 


Algebraic Extensions and Elements 

Give an example of an algebraic extension that is not finite. 

Using the fact that every polynomial over C has a root (to be proved 
in Chapter 26), prove that C has no proper algebraic field exten- 
sions. 

Find a proper field extension of C, bearing in mind that it can- 
not be algebraic, by Exercise 8. (Hint: C[x] is an integral domain 
containing C.) 

Show that F[a, 6] = (F[6])[a]. Prove that ifm = deg a and n = deg b 
are relatively prime then Fla, 0] is a field of dimension mn over F. 
(Hint: F[a,b] contains both Fla] and F[b], and thus [Fla,}] : F] is 
divisible both by m and n.) 

V3 + V7 has degree 4 over Q. (Hint: [Q[V3. V7] : Q|) = 4. But 
V3 4 V7 is not quadratic over Q.) For a more systematic method 
of computation, see Exercise 15. 

[Ql /pr----. pe] : Q) = 2°, for distinct positive prime numbers 
Pls+++sPt- 

Suppose a is a root of the polynomial yy mx" for m; in Z. Then 
m,za is a root of a suitable monic polynomial (over Z) of degree t. 
(Hint: 0 = mi7! 3 mat = (mea)* + me_1 (mia)? t+...) 
Suppose V is a vector space of dimension n over a field F. Write 
Homp(V,V) for the set of linear transformations of V, taken natu- 
rally as a subring of Hom(V,V) (viewing V as an Abelian group). 
Taking a base {b;,...,6,} of V over F, show that any linear trans- 
formation T:V — V over F is defined uniquely by the action on 
the base bj + DYi_, ajjbj, where each a;; € F. Thus T ++ (aj;) 
defines a 1:1 correspondence Homg(V,V) > M,(F), which is an 
“anti-isomorphism” of rings — it preserves addition but reverses 
the order of multiplication. 

(The regular representation revisited, cf. Exercise 14.4) Suppose 
Ais an algebra of dimension n over a field F. The composite sending 
a to the right multiplication map and then to the corresponding 
matrix yields an injection of A into M,,(F). (Hint: Multiplication 
is reversed twice.) 

View C explicitly as a subfield of M2(R). 

Reprove Remark 13 by means of Exercise 15 and the characteristic 
polynomial. This proof is “constructive,” in the sense that one 
can actually compute a polynomial satisfied by an element; as an 
application, solve Exercise 11 by this method. 

The direct product R = CxC contains the subfield {(a,a) : a € C} 


which is a field isomorphic to C. Show that [R: C) = 2; why does 
this not contradict the fact that C is algebraically closed? 


CHAPTER 22. THE PROBLEMS OF ANTIQUITY 


In the eyes of the ancient Greeks, there was a deep connection between 
mathematics and life, and numbers entered into every facet of philoso- 
phy, science, and art, such as the “golden ratio” (24%), an important 
guide in architecture. The only real numbers of interest to them could 
be constructed, using a compass and straight edge, by means of certain 


well-defined rules, which in modern terms could be described as follows: 
Construction by Straight Edge and Compass 


Definition 1. We work in the Euclidean plane, and construct real numbers 
in a sequence of steps. We start at Step 1 with two points which we call 0 
and i, and with the line Lo defined by these two points (which we identify 
as the “real line”), but with no other lines or circles. By convention we 
identify Lo with the X-axis in the plane, and we shall define any real 
number a in terms of its corresponding point @ on Lg. Thus the numbers 
0 and 1 are defined by the points 0 and 7. 

Inductively, suppose we have defined various lines, circles, points, and 
numbers at Step i— 1. In addition to carrying these over to Step 2, we pro- 
ceed to define new lines, circles, points, and numbers at Step 2, as follows: 


1. A point is defined at Step z if it lies in the intersection of a line and 
a line, a line and a circle, or a circle and a circle, which already have 
been defined at Step 7 — 1. (Note that the intersection of the circle 
with a line or another circle will normally define two points, so there 
may be acertain ambiguity in describing these constructions. There 
are various ways of avoiding this ambiguity, but they are irrelevant 
to the present discussion.) 

2. A number a is defined at Step z if the point @ has been defined at 
Step 2. 

3. A line is defined at Step z if it passes through two points defined at 
Step z. The line defined by points P and Q will be denoted L(P, Q). 

4. A circle is defined at Step 2 if its center is a point defined at Step 2 
and if its radius is the distance between two points defined at Step 2. 
The circle defined by center P and radius r will be denoted C(P;7). 
(Intuitively, we construct the circle by opening the compass to the 
distance between the two given points, and then placing the center 
of the compass at the prescribed center of the circle.) Although we 
do not specify a priori that the radius be defined, it turns out to be 
a constructible number, by Remark 2 below. 


A number that is defined at some step is called a constructible number. 
Typeset by A,yS-TEX 
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Thus, at Step 1 we may define the two circles C(0;1) and C(i;1). 
These intersect Lo also at the points —i and 2 respectively, which thus are 
defined at Step 2. Hence the numbers —1 and 2 are defined at Step 2. 

The two points of intersection of C(0:1) and C(i:1) are defined at 
Step 2, and thus the line L, connecting these two points is defined at Step 
2. Hence the point 11,9 Lo = + is defined at Step 3, thereby yielding the 
number >. the first noninteger to be constructed. 


Remark 2. The radius r of a definable circle is a constructible number; 
indeed, # € C(0;r) N Lo. 


We shall use the following familiar constructions: 


(1) A perpendicular to a given line, passing through a defined point off 
the line; 

(2) A perpendicular to a given line, passing through a defined point on 
the line; and 

(3) A line passing through a defined point P and parallel to a defined 
line LE. (Actually this is an immediate consequence of (1) and (2); 
we drop the perpendicular £, from P to DL, and then take another 
perpendicular from Ly at P.) 


Diagram 1 illustrates constructions (1) and (2) respectively. 


One question of utmost importance to the ancient Greeks was, “Which 
numbers are constructible?” More explicitly, they asked: 


1. “Doubling the cube.” Can one construct a cube having double the 
volume of a given cube? 

2. “Squaring the circle.” Can one construct a square of the same area 
as a given circle? 

3. “Trisecting an angle.” Can one construct an angle one-third as wide 
as a given angle? (There is a well-known procedure for bisecting an 
angle.) 


4. “Constructing the n-gon.” For which n can one construct a regular 
polygon of n sides, also called the regular n-gon? 


Our objective is to utilize the field theory developed so far, to handle 
these questions. Questions 1,2, and 3 will be shown to have negative an- 
swers, whereas the solution of question 4 is started in Exercises 8ff (although 
the full solution is given only in Chapter 26.) 


Remark 8. Let us rephrase these questions in terms of constructibility of 
numbers. 


1. If the edge of the given cube is 1, then the edge of the second cube 
is y 2, so we want to construct 2. 

2. If the radius of the circle is 1, then its area is 7, so we want to 
construct /7. 

3. If the given angle is 36 then we want to construct 6. 

Let us consider further the construction of angles. Putting the 
vertex of the angle at 0 and taking one of the sides of the angle to 
be Lo, we let P be the intersection of the other side with C(0;1). 
Dropping a perpendicular from P to Lo yields the number cos 6; 
conversely, given cos 6 we can reverse the steps, raising a perpen- 
dicular to the unit circle and thereby reconstructing the angle 6. 
In this way, constructing an angle is tantamount to constructing 
its cosine. Thus, our question is, “If cos(3@) is constructible then is 

3 
case of this question is whether cos 


cos 6 constructible?” Clearly, cos = = .5 is constructible, so a special 


9 
, since then we could subdivide the 


is constructible. 


4. We need to construct the angle == 
circle into n equal parts, thereby enabling us to inscribe the regular 
n-gon in the circle. As above, this is equivalent to constructing 
cos 2 although we shall find it much more instructive to pass to 


primitive roots of 1. 


Algebraic Description of Constructibility 

All of these questions thereby boil down to verifying whether or not 
certain explicit numbers are constructible, and we shall solve these problems 
by finding an algebraic criterion for a number to be constructible. It turns 
out that V2, Ja, and cos ¢ all fail this criterion, and so the answer to each 
of these questions is “No.” Our task is to relate constructibility to the 
previous material about fields. In order to apply that theory we consider 
all constructible numbers at once. 


PROPOSITION 4. The set C of constructible real numbers is a subfield of R, 
satisfying the property that if a € C then fa € C. 


Proof. First we verify the field axioms for C. If a,b € C, then b+a € C, 
seen by taking the intersections of C(b: a) with Lo. Thus, (C,+) is a group. 

Given a,b € C with a > 0 and b ¥ 0 it remains to construct ¢ and /a, 
which is done by means of Diagrams 2 and 3: 


Note. Although one could construct ./a rather easily by means of similar 
triangles, the construction here uses an elegant application of Pythagoras’ 
theorem applied to the right triangle CiP. Note that C corresponds to ite: 


the distance along the perpendicular down from P to tis 
= Te ta \? tea \? 
CP -Cl = 5 —{1- ; = Ja. 


(Note that we assumed a < 1, but this can always be done if we replace 
aby + whena> 1.) 0 


Let us call a set S of real numbers constructible if each element of S is 


constructible. 


CoroLiary 5. If K = F[Val] is a field, where F is constructible and 
0<a€ F, then K is also constructible. 


In view of this result, we would like to see what it means to adjoin the 
square root of an element to a field. In preparation for the main result, we 
say afield K C Cis quadratically defined if there is a chain of fields 


Q=KhCKHCHh:-CR=K (1) 


for suitable t, such that each [Fi41 : Fj] = 2. 

The next result could be viewed as an introduction to the deeper theory 
of Chapter 27. Recall the definition of the compositum KL from Proposi- 
tion 21.18. 


LEMMA 6. (i) If K and LE are quadratically defined fields then KL is a 
quadratically defined field. 

(ii) Any subfield L of a quadratically defined field K is quadratically 
defined. 


Proof. (i) If there are chains of fields 


Q=KhKCKCH:-Ckra=K, 
Q=Hh CH, CEH::-CE,=L, 


for suitable m,n, such that each [Fj41 : Fj] = 2 and each [B41 : Bj] = 2, 
then 


Q=hCRHCH::-CR,=KCKE, CKE,C::-CKE, =KL; 


each successive extension has degree < 2, by Remark 21.19. 

(ii) Take the chain (1). Letting F/ = F; L, we see (by extending a 
base of Fj to a base of F;) that [Fi,, : Fj] < [Fina : Fi] = 2, so the F! 
produce the desired chain from Q to ZL. 0 


THEOREM 7. The following assertions are equivalent for a real number a: 
(i) a is constructible; 
(ii) a is contained in a quadratically defined subfield of R; and 
(iii) Qla] itself is a quadratically defined subfield of R. 


Proof. (ii) => (iii) by Lemma 6(ii). 

(viz) => (i) Take a chain Q = Fy C F, C Fy--: C Fy = K, such that 
each [Fj41 : Fj] = 2. By Example 21.12(iv) we can write Fj4, = F;[,/a,] for 
a; € F;. Clearly Q is constructible, by Proposition 4. But by Corollary 5, 
if F; is constructible, then F;41 is constructible. Hence F; is constructible, 
by induction. 

(i) = (ii) We shall prove for any constructible point P, viewed in 
terms of its Cartesian coordinates (%p,yp), that there is a chain (1) such 
that sp,yp € F;. This is the same as proving the assertion of the theorem, 
since P is a constructible point iff sp and yp are constructible, cf. Exercise 
4. However, it is easier to set up the induction in this manner, on the Step 
i at which P is defined. Let S be the set of (X- and Y-) coordinates of the 
points defined before Step z. Since S is finite, we appeal to induction and 
Lemma 6 to obtain a chain 


Q=hHCRCHh::-CR=L 


for which S C EL, with each [Fj41 : F;] = 2. Thus, it suffices to prove that 
[L[zp.yp] : L] < 2 (since then K = [L[xzp, yp] : L] is quadratically defined.) 
This turns out to be an exercise in analytic geometry, as we shall see now. 
The equation of the line passing through points (21, y1) and (a2, y2) is 
y—Y1 = mM(a2—-— 21) where p= (2) 
by — Py 

and the equation of the circle with center (29, yo) and radius equal to the 

distance between points (x3, y3) and (x4, y4) is 


(a —29)°+(y— yo)? =s where s = (24-23)? + (ya — ys)”. (3) 


By definition, (xp,yp) is a simultaneous solution of two equations, each 
having the form (2) and (3), where all x;,y; € LD. 

We treat each case in turn. The intersection of two lines involves 
solving two simultaneous linear equations each of the form (2), and thus 
has its solution in L, see Exercise 6. 

The intersection of a circle and a line involves the solution of (2) and 
(3). But using (2) we could substitute m(z — 21) + y1 for y in (3), so 
(3) turns into a quadratic equation in x, which has its solution in some 
quadratic extension K of LE, and y € K by (2). 

Finally, the intersection of two circles involves solving (3) and 

(o— mh)? +(y- mh)? =s”, (4) 

But subtracting (4) from (3) cancels out the z? and y? terms and thus 
leaves a linear equation with coefficients in DL, so we continue as in the 
case of the intersection of a circle and a line. (Geometrically, our equation 
describes the line passing through the points of intersection of the two 
circles.) 0 


COROLLARY 8. If a is a constructible number, then a is algebraic and its 
degree is a power of 2. 


Proof. By the theorem QJa] is a quadratically defined field. Thus, applying 
theorem 21.14 to (1), we see [Qfa] : Q] = 2° for some #; hence deg a = 2'. 0 


Solution of the Problems of Antiquity 
We are now ready to solve problems 1,2, and 3 given before Remark 3. 


COROLLARY 9. (Compare with Remark 3.) The numbers V2, /7. and 


cos 7 are not constructible. 


Proof. We shall show that none of these are algebraic of degree a power 
of 2, and thus we shall be done. 


1. V2 has degree 3, cf. Example 21.12(v). 

2. Since ,/x is quadratic over 7, we may pass to 7, which is transcen- 
dental. However, this is a rather deep theorem, whose proof is given 
in Appendix A. 

3. Let us determine the minimal polynomial f of cos >. More generally, 
we want to determine cos@ in terms of cos 34. To streamline the 
computations we appeal to the mathematics of the complex unit 


circle; the point corresponding to the angle 6 is e* = cos@ +isin 0. 
Then 


ene 
cos 6+ isin 6)? 
cos 6)* + 3i(cos 8)” sin @ + 32” cos (sin 6)” + 7° (sin 6)? 


=A 
= 
= 
= (cos* @ — 3cos@ sin” 8) + i(3cos* 6 sin @ — sin® 6). 


Matching real parts and substituting 1 — cos? @ for sin” 6 yields 


cos 36 = cos? 6 — 3cos6@ sin? @ 


= 4cos*® 6 — 3cos 6 


Now putting 6 = $, we see cos 36 = cos = 3. so cos 6 is a root of the 


polynomial 42° — 32 — >. or equivalently, of f(x) = 82° — 6r — 1. But f is 


irreducible, by Example 20.14. 0 

Remark 10. The regular n-gon: Initial results. The same line of reasoning 
can be used to study the sonst of the regular n-gon, or equiva- 
lently, constructing the angle 2 =*. But interpreting the unit circle as in the 
complex plane, this is tantamount to constructing the point corresponding 
to a primitive nth root p, of 1. Although we have focused on constructible 
real numbers, one could just as easily have defined constructible complex 
numbers, by saying a+ bz is constructible if both a and 6 are constructible 
real numbers. The theory goes through in this greater generality, as is seen 
in Exercise 4; the proofs of Theorem 7 and Corollary 8 also apply to com- 
plex numbers, so p, can be constructed only if its minimal polynomial f,, 
has degree a power of 2. When n is prime, we have seen f, = 2" '+---+1, 
which has degree n — 1, son — 1 must be a power of 2. In particular, one 
cannot construct the regular 7-gon, 11-gon, or 13-gon. Furthermore, one 
cannot construct the regular 9-gon, in view of corollary 9(iii). Further in- 
sight is provided in Exercises 9ff, in which we see the regular 5-gon can 
be constructed. Thus, the smallest nontrivial case is n = 17(= viene 1). 


In a tour de force, the young Euler constructed the 17-gon; his result fol- 


lows from Theorem 26.7 below, which determines the constructibility of all 


regular n-gons, modulo a famous question in number theory. 


Exercises 


Constructibility 


1. 


10. 
Lt, 


Describe rigorously the construction of a perpendicular to a given 
line, at a given point on the line. 


. Describe rigorously the construction of a perpendicular to a given 


line, at a given point off the line. 

Define the Y-axis of the plane to be the line perpendicular to the X- 
axis at 0. (Thus (is the origin, and we have the axes of the Cartesian 
plane.) Written in Cartesian coordinates, the point P = (xp, yp) 
is defined iff zp and yp are constructible numbers. (This is seen by 
projecting down onto the X- and Y-axes.) 

The number a+ bi is quadratically defined iff the point (a,b) on the 
plane (with respect to the axes of Exercise 3) is constructible. (Hint: 
a,b are quadratically defined, as is i = /—1, so apply Lemma 6(i).) 
Describe rigorously the construction of the bisector of an angle. 
How can one infer the existence of the angle bisector using algebra 
instead of geometry? 

Solve explicitly equation (2) (from the proof of Theorem 7) and 


y-—y =m'(a2—-2') 


to get » = merrwd—(n W) € Land y EL. 


Suppose a is algebraic of degree 4. Show that a is constructible iff 
Qla] contains a proper F-subfield ¥ FP. 


Constructing a Regular n-gon 

If a regular p-gon can be constructed, for p prime, then p has the 
form 27 +1. (Hint: p = 2” +1 for some m. But if m has an 
odd factor k, then 2™/* + 1 divides p, a contradiction.) Construct 
a regular 3-gon. 

Any primitive fifth root p of 1 is constructible. (Hint: This is 
sometimes proved directly in geometry courses, but is not difficult to 


~! satisfies 7? +2 = 1. Conclude 


show algebraically; show that p+ p 
that the regular 5-gon is constructible. 

If the regular n-gon is constructible, then so is the regular 2n-gon. 
Determine the constructibility of all regular n-gons for n < 16. 
(Hint: No for n = 7,9,11, 13,14; yes for all others. Note, for exam- 


ple, that if p,, is a primitive n-th root of 1 then pis € Qlps,ps].) 


CHAPTER 23. ADJOINING ROOTS TO POLYNOMIALS: SPLITTING FIELDS 


In the previous sections we studied algebraic elements in terms of the poly- 
nomials that they satisfy. Now we reverse the question and ask, how can we 
determine the roots of a polynomial? This question has several possible in- 
terpretations, and there exists a highly developed theory of approximating 
roots of a polynomial in a given field. However, we are interested in an ex- 
act algebraic solution, and thus our first interpretation is, “Given f € F[z], 
can we find a field K that contains all the roots of f?” First we locate one 
root. To understand the question, let us examine a familiar situation. 

Suppose we met someone who does not know what the complex num- 
bers are. We might very possibly describe them as the field R[—1]. That 
person would then be looking for an R-field that contained an element 2 
satisfying i? = —1, or equivalently i? + 1 = 0; in other words he or she 
would want to adjoin a root of the polynomial xz? + 1 to the field R. There 
is a formal construction that works for any irreducible polynomial f. To 
obtain a root x of f we declare formally f(x) = 0; the easiest way to do 
this is by taking the polynomial ring F[2] and factoring out the ideal (f(z)) 
(thereby “forcing” f(z) to become 0). Thus, we shall be combining aspects 
of ring theory with the vector space applications used in Chapter 21, and 
to this end we need to reconsider the notion of inclusions of rings. 

What do we really mean when we say Z C Q? When defining the 
rational numbers, as in Chapter 15, we do not actually include Z itself, but 
rather the set {7 : € Z}, which is naturally identified with Z under the 
correspondence 7 +> n. In the language of algebra, Q contains a subring 
which is isomorphic to Z, or equivalently, there is a ring injection 7: Z > Q 
sending n> >. Thus, we could think of an F-field K as a field K together 
with a ring injection 4: F > K; then we identify F with its image 7)(F) in 
kK. We can weaken this formally, in view of the following straightforward 
but useful observation. 


Remark 1. If F is a field and w: F > R is a ring homomorphism, then 4 is 
an injection; indeed, ker is a proper ideal of F and thus is 0. 


The way has been cleared to introduce the following structure. 


Definition 2. Suppose F is a given field. A commutative ring R is called 
an F-algebra if there is a ring homomorphism 7: F > R. 


Then 7 will be an injection, by Remark 1, so we identify F with its 
image under y, called the “canonical image” of F; in this way we view F 
as a subfield of R. In particular, an F-algebra K that happens to be a field 
is an F-field. 
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Example 2'. Any subfield F of Cis a Qalgebra, and thus a Q-field. Indeed, 
we define the injection 4:Q > F by (2) = ml ef, Example 15.9. 


ni? 

By being precise, we have introduced an ambiguity: An F-algebra R 
could be an F-algebra in several different ways, depending on the choice 
of the homomorphism 7: F > R, cf. Exercise 1. Thus one should keep 
the particular injection 4 in mind. Nevertheless, in this treatment we shall 
tend to view F C K, and we shall downplay the role of y. This ambiguity 
motivates the development of the theory in Chapter 24. 

If Ri, Ry are F-algebras, then, viewing both F C R,; and F C Ra, 
we define an F-homomorphism to be a ring homomorphism y: R, > Re» 
that fixes F, ie, g(a) = a for all a in F. An F-isomorphism is an 
F-homomorphism that is an isomorphism. 

One very important example: F[z] is clearly an F-algebra, where the 
map w:F > F[z] sends an element a of F to the constant polynomial a. 
For any F-algebra R, one can extend 7 to the substitution homomor- 
phism 4: F[z] > R given by z » a, cf. Lemma 18.2, and this is an 
F-homomorphism. Note that the image of F[z] under 4), is Fla]. 

Here is another important example. 


PROPOSITION 3. Suppose F is a field, and f € Fa] is monic irreducible. 
Let L = F[z|/(f). Then L is an F-field, and (identifying F with its canon- 
ical image in L), L = F[Z] where & is the image of x in L. Furthermore, 
f is the minimal polynomial of % over F, and [L: F] = deg f. 


Proof. L is a field, by Remark 21.9’, so is an F-field. Let ~ denote the 
canonical homomorphism from F[z] onto L = F[a]/(f). Any element of L 
has the form S* ar? = S> a; 2", so, identifying F with F we have L = F[). 
Furthermore, if f = SSi_, Ba*, then 0 = f = , Biz’ = f(z); since 


f is irreducible we conclude from Proposition 21.6 that f is the minimal 


polynomial of %. Hence [EL : F] = deg f, by Proposition 21.8.0 


Note. In case we already have found an F-field K = Fla], where a is a root 
of f, then Fla] ~ F[a]/(f), by Theorem 21.10. For example, C = R{i] = 
R[z]/(2? +1). Let us continue this reasoning. 


PROPOSITION 4. Suppose K,, K> are F-fields, and a, € Ky, az € Ky are 
both roots of the same irreducible polynomial f € F|a]. Then there is an 
F-isomorphism Fla,] > Fla,] sending a1 + ap. 


Proof. Apply Theorem 21.10 twice (once in reverse) to get the isomorphism 
Pla] > Flz]/{f) > Fla2] sending a4 Hb eH a. O 


Sometimes both of these roots are in the same field. 


Example 5. (i) The roots of 2? + 1 in C are 47. Therefore, there is an 
R-isomorphism C > C given by i> —12; this is complex conjugation. 

(ii) the roots of 2? —2 in Q[Y2] are +2. Hence there is a Q isomor- 
phism Q[V2] > Q[V2] given by V2 - V2. 


Let us digress briefly, to obtain an important (although easy) converse. 


PROPOSITION 6. Suppose K is an F-field, f(x) € Flz], anda € K isa 
root of f. Then o(a) is also a root of f, for any F-isomorphism o: K > K. 


Proof. Write f = >> a;2x' for suitable a; in F. Then 


f(o(a)) = So ai0(a)' = So o(aajo(a’) = oS ava) 
=o(f(a))=o(0) =0. O 


Propositions 4 and 6 will lie at the heart of our theory, since they indi- 
cate the close connection between roots of a polynomial and isomorphisms 
of suitable field extensions. 


Splitting Fields 

We return to our main quest. Having found one root, we now want 
them all, in a suitable extension field K of F; we also want such K to be 
uniquely defined (up to F-isomorphism). This task might seem unmanage- 
able at the outset, but becomes much easier if we recall from Theorem 18.5 
that f can have at most n roots in K, where n = deg f. Thus, our strategy 
is to use Proposition 3 to adjoin one root at a time, until we are done (after 
at most n steps). Let us describe this procedure more precisely. 

We say a polynomial 0 4 f € Fla] splits over K D F if f can be 
written in K[z] in the form 


9(@—a1)...(@-—Gy), a; EK. 


(Here y is the leading coefficient of f and thus is in F; without loss of 
generality, we shall take f to be monic, so that 7 = 1.) 

This condition implies that a1,...,@, are roots of f and actually are 
alithe roots of f in any field extension of AK’, since any root 5 satisfies 


0= f(b) = 7(b— 41)... (b- an), 


implying 6 is one of the a;. 


Definition 7. A splitting field of a polynomial f (over the field F ) is a field 
E 2D F satisfying the following two conditions: 


(i) f splits over EB, and 
(ii) f does not split over K for any F C K C E. 


Remark 8. Another way of saying that F is a splitting field of a monic 
polynomial f is that EF = Flay,...,an], with f = (# — a1)...(a% — ay) 
in E [2]. Intuitively, F is the field we get by adjoining the roots of f. (In 
particular, [EZ : F] < co since E is obtained by adjoining at most n roots.) 


This point of view is very intuitive and enables us to work in any field £ 
over which f splits as f = (w—a,)...(—a,), since then Flay,...,@n] C LB. 
Then £ cannot contain any other splitting field of f; in fact f has no other 
roots in L, as noted above. (Most commonly, L = C.) Other immediate 
consequences of Remark 8: 


Remark 9. Suppose F is a splitting field of f over F. 

(Gi) If fF C K CE, then F is also a splitting field of f over K. 

(ii) Any factor g of f in F[z] also splits over #. (Indeed, by unique 
factorization in E [2x], the factorization of g is part of the factorization of f 
into linear factors.) 

Gi) If PF C K C Band f = (x-a)gin K[z], then EF is a splitting field 
of g over K. (Indeed, F is generated over K by the roots of g.) 

(iv) Conversely to (iii), if K = Fla] and f = (#—a)g, then any splitting 
field L of g over K is also a splitting field of f over F. (Indeed, writing 
g = (%@—-a2)...(2—a4) in K[z], we see L = K[ao,..., a+] = Fla,ae,...,a4].) 


Example 10. (based on Example 21.12.) In each case, FE will denote the 
splitting field of f over F. 

(i) If f =x2-—aforaé F, then H= F. 

(ii) C is the splitting field of x? +1 over R. 

(iii) C contains the splitting field of x? + 1 over Q, which is Q[i]. In 
general, if f € F[2] is irreducible with a root in K \ F with [K : F] = 2, 
then # = K. (This can be seen either via Example 21.12(iv) or as a special 
case of Theorem 11 below.) 

(iv) If f= 2" -—1 and F = Q, then EF = Qlp], where p is a primitive 
nth root of 1. Indeed, f = (2 — 1)(% — p)... (a — p71), by Proposition 
18.14. 

(v) If f= 2" 1+---+1 and F = Q, then, by Remark 9(iii), EF = Ql[p]. 
where p is a primitive nth root of 1; indeed, 2” — 1 = (x — 1)f. 

(vi) If f = 2” — p for p prime, and F = Q, then E = Ql xf, pl. 
where p is a primitive n-th root of 1. (Indeed, by Proposition 18.14, E = 
Qla, pa,....p"~'a], where a = 3/p. But then p = (pa)a~! € E, implying 
B= Qla, p].) 


If one wants to start with an arbitrary base field F and develop the 
theory from scratch, one must confront the following questions, for any 
polynomial f in F[z]: 

(1) Does f have splitting fields? 

(2) Is the splitting field unique? 

(3) How many distinct roots does f have in its splitting field (since 
perhaps a root repeats)? 


We start by answering question (1). 


THEOREM 11. Any polynomial f over F has a splitting field E for which 
[Ef : F] < n!, where n = deg f. 


Proof. By Proposition 3, we can find K D F containing a root a of f, 
with [kK : F] <n. But then f = (a — a)g in K[z], by Corollary 18.4, so 
deg g = n-—1. By induction g has a splitting field # over K, for which 
[E: K]) <(n-1)!,s0[B: F] =[E: K][K : F] < nt; Remark 9(iv) shows 
that & is the splitting field of f. 0 


Remark 12. In Theorem 11, if f is irreducible, then n|[Z : F]. (Indeed, if 
ais a root of f, then [Fla] : F] =n divides [F: F].) 


We can answer (2) by proceeding formally. Given two splitting fields 
Ei, FE, of f over F, we want an F-isomorphism £, > E.. 


Remark 13. Any F-homomorphism ©: F, > EF» of splitting fields (of f) is 
an F-isomorphism. (Indeed © is 1:1 by Remark 1, implying ®(£,) is an 
F-subfield of E, over which f splits, implying @(£,) = FE». 


Thus our quest for uniqueness reduces to locating an F-homomorphism. 
We get this by lifting F-homomorphisms step by step. Suppose yg: F, > F 
is an F-homomorphism of F-fields. Then y extends to an F-homomorphism 
¢: F(z] > Fs[z], given by SS azz? BH SS p(a;)z". 


LEMMA 13’. Suppose yp: Fj — Fy is an F-isomorphism, and fy, = (fi). 
Let EF; be a splitting field of f; over F;, fori = 1,2. Then y extends to a 
natural F-homomorphism FE, > Ez. 


Proof. Induction on n = [E, : F,]. The idea of proof is to show that the 
method used in the proof of Theorem 11 works essentially the same way in 
all situations. Let g; be an irreducible factor of f;, and let a, be any root 
of g; in Ey; let gz = $(g1), and let az be any root of gz. The composition 
F,[2] 4 Fy[2] 3 F.[x]/(g2) is an onto homomorphism whose kernel is (g1), 
thereby yielding an isomorphism ¢: F,[z]/(g1) > F2[2]/(g2) that extends ¢. 


Now, by Proposition 4 we have a composition of isomorphisms 


Fy [ai] © Fila]/(n) & Bola\oo) © Fala. 


which we call gy. Certainly F; is a splitting field of f; over F;[a,], for 
t= 1,2, but [F, : F,[ai]] < n, so by induction gy, extends to the desired 
F-homomorphism FE, > E,. 0 


Now taking F, = F, = F in Theorem 13’, we have 


THEOREM 13". The splitting field of a given polynomial over a given field F 
is unique up to F-isomorphisin. 


Thus, we may speak of “the” splitting field. In Chapter 24 we shall 
need a more precise result concerning the number ng/K; 1 = "%z/K; L(Y) of 
F-homomorphisms & > EF extending any given F-homomorphism y: K > 
L, where F = Flay,...,@,,] is an F-field generated by various roots a1,..., Qu 
of a given polynomial f € F[z], and K is an F-subfield of FE. To compute 
this number we proceed “up the ladder” just as in the proof of Lemma 
13’. If f = a;2", we write f, for > p(a;)z", the polynomial naturally 
corresponding to f in L[z]. 


THEOREM 14. With notation as above, ng/x, 1 < [FE : K], equality hold- 
ing if E is a splitting field (of f ) and f,, has deg(f) distinct roots in L. 


Proof. Induction on n = [EF : K]. If n = 1, then EH = K, so the only 
F-homomorphism extending y must be g itself. 

In general, for n > 1 we take any root a of f in EF \ K. Any extension 
of y to an F-homomorphism o: # > £ can be viewed as an extension first 
to an F-homomorphism 7: K[a] > EL and then to o. 

Let g be the minimal polynomial of a over A’. We claim that nxjaj/K; 1 
is the number of distinct roots of g, in L. Indeed, any 7 : K[a] > L is given 
by n(3> aya") = 3 v(a;)n(a)'. so n is determined by (a), which is a root 
of gy by the argument of Proposition 6, i.e. 


On the other hand, for any root 6 € L of g, we have the substitution 
homomorphism K[z] > L extending 9, satisfying x 4 b, cf. Lemma 18.3; 
the kernel contains (g) and thus we have 


Ka] = K[2]/(g) > LE 


by Noether I, satisfying a +> 6. In this way each root of gy gives rise to a 
homomorphism K[a] > L extending y, proving the claim. Thus 


NKal/K; L < deg gy, = deg g = [K [a]: K}, 


equality holding if g, has deg(g) distinct roots in L. 

By induction, the number of F-homomorphisms FE > L extending any 
n: K[a] > L is at most [EF : K[a]], equality holding if f, has deg(f) distinct 
roots in LE. We conclude 


NB/K; LS MKal/K; LM B/K[q S [K[a]: KE: Kal] =[E£: K], 


equality holding if f, has deg(f) distinct roots in L (since g, divides f,,). 0 


Example 15. (i) The splitting field of f = «* — 2 over Qis FE = Q] V2.7]. 
by Example 10 (vi). Another way of constructing the splitting field is first 
to adjoin 2 and then observe that f factors over Q[W2] as 


(2? — V2)(0? + V2) = (a — VI(x+ V2)(0? + V9). 


Thus, (Q[ W2])[z]/(z? + V2) is another splitting field of f and, of course, is 
isomorphic to Q)W2,i V2] = Q| V2.7] = E. 

(ii) The polynomial f = 24 +4 has roots (£147). Thus its splitting 
field is Q[z], whose dimension over Q is only 2, although deg f = 4 and f 
has four distinct roots. 


Note that the condition in Theorem 14 needed to obtain equality has 
two parts: f, splits in L, and its roots in L are all distinct. This leads us 
back to question (3) (posed after Example 10). 


Separable Polynomials and Separable Extensions 

We say a polynomial f € F[z] is separable if its roots (in a splitting field of 
f over F) are all distinct, i.e., the number of distinct roots is deg f. We say 
f is inseparable if f is not separable, t.¢e., f has a repeated root. To avoid 
a vacuous situation, we shall always assume that deg f > 1. Separable 
polynomials tie in to Theorem 14, foreshadowing the theory of Chapter 24: 


COROLLARY 16. Suppose F is the splitting field of a separable polynomial 
f over F. Then the number of F-isomorphisms from F to itself is precisely 


[Ef : F). 
Proof. Just take y: F > E to be the inclusion map. 0 


This result is so critical to the theory that we shall concentrate exclu- 
sively on separable polynomials for the rest of this course. The study of 


separable polynomials involves an algebraic approach to calculus. Given a 
polynomial f = 37 a;x', we formally define its derivative f! to be Sy iajx""!: 
it is a simple matter to check that the usual rules of derivative hold, for all 
polynomials f, g: 


(f+) =f tg’: (1) 
(af)'=af' for ain FP; 
(fo) = fot fa. 


PROPOSITION 17. Let EF be a splitting field of f. The polynomial f is 
separable iff f and f' are relatively prime in E[z]. 


Proof. We prove, equivalently, that f has a repeated root iff f, f’ are not 
relatively prime. 
(=) On the contrary suppose (x — a)?|f. Then f = (2 — a)?g so 


f= («- a)'(a— a)g + (x — a) ((a — a)g)’, 


which obviously is divisible by x — a. 

(<=) Suppose f and f’ are not relatively prime in E[z]. Then some 
irreducible factor 2 — a of f also divides f'. Write f = (2 — a)g; we know 
x —a divides f! =g+(x—a)q', so x — a divides g, implying (« — a)?|f.0 

It would be a shame to have to pass to E[z] when applying this crite- 


rion; fortunately, we have the following fact. 


PROPOSITION 18. Suppose F C K are fields, and f,g € Fla]. Then 
ecd(f,g) is the same in F [x] and in K[z]. 


Proof. Let dp,dx be the respective gcd in F[z] and K[z]. A fortiori, 
dp divides f and g in K[s], so dp|dx. On the other hand, (dr) = (f)+ (g) 
in F[z], so there are hy, hy in F[z] such that dp = hy f+hog; but dx divides 
the right-hand side, and thus dgx|dz. 0 


CorRoLiaryY 19. A polynomial f in F[z] is separable iff f and f' are rela- 
tively prime in Fz]. 
COROLLARY 20. An irreducible polynomial f is separable iff f' 4 0. 


Proof. f,f' are not relatively prime, iff gcd(f, f’) = f (since f is irre- 
ducible), iff f|f', iff f' = 0 (since deg f' < f). 


How can f' = 0? From calculus (where F C R) we learned this implies 
f is constant, which is ruled out since we are assuming deg f > 1. Thus, we 


conclude for F C R that every irreducible polynomial over F is separable. 
Soon we shall improve this result, but first let us record a useful fact. 


Remark 21. Suppose f = gig in F[z]. Then f is separable iff g; and go 
are both separable and relatively prime. (To see (=), let h be the gcd of 
gi and g.. Then h?|f, implying any root of h is a multiple root of f.) 


The Characteristic of a Field 

In order to study fields that are not necessarily subfields of R or C, we define 
m-1 ina field F to be 141+4+---+1, taken m times. Note (m-1)(n-1) = mn-1; 
the formal proof is an easy exercise in induction. 


Definition 22. The characteristic of a field F, denoted char(F), is the order 
of the element 1 in (F,+), 7.e., char(F) = m if m-1 = 0, for m > 0 minimal 
such. If m- 140 for all m > 0, we say char(F) = 0. 


Example 28. char(C) = 0 = char(R) = char(Q). On the other hand, 
char(F’) > 0 for any finite field F (since (F,+) is a finite group), and, in 
particular, char(Z,,) = p. 

Along another line, a field F has characteristic # 2 iff 1+ 1 4 0, iff 
2 is invertible in F, 7.e., > € F. The existence of > often is very useful, for 
example, in Example 21.12 (iv). 


Remark 24. Suppose p = char(F) > 0. 

(i) p is a prime number. (Otherwise let p = ab for 1 < a,b <p. Then 
0 = (p-1) = (a-1)(6- 1) soa-1 =0 or b-1 =0, contrary to the minimality 
of p.) 

(ii) pa = (p- 1)a = 0a = 0, for all a in F. 

We can view the characteristic more structurally. For any field F there 
is a ring homomorphism yg: Z > F given by y(m) = m.-1. 

Case I. y is an injection. Then char(F’) = 0. Note that F contains a 
copy of the field of fractions of Z, which is Q. 

Case II. ¢ is not an injection. Since y(Z) C F is an integral domain, 
we see that ker y is a nonzero prime ideal of Z, so ker y = pZ for some 
prime number p, implying char(F) = p. (This gives a “structural” proof 
that char(F) is prime.) Also, Noether I yields an injection ¢:Z/pZ— F. 


Remark 25. Summarizing the two cases above, we have 
(i) char F = 0 iff F contains an isomorphic copy of Q. 
(ii) char(F) = p > 0 iff F contains an isomorphic copy of Zy. 


Let us return to separability. 


Definition 26. A separable element over F is an element whose minimal 
polynomial over F is separable. A separable extension of F is an F-field 


all of whose elements are separable over F. In this case, we also say the 
extension K/F is separable. 


F is perfect if every finite extension is separable. 


Example 27. Qis a perfect field; more generally, every field of characteristic 
0 is perfect. (Indeed, in characteristic 0 the derivative 7 ia;z’~1 of a non- 
constant polynomial f = S>a;x* cannot be 0, implying every irreducible 
polynomial is separable, by Corollary 20.) 


We shall see in Chapter 25 that every field of finite order also is perfect, 
despite the existence of nonconstant polynomials with 0 derivative (such as 
x? +1 in Z,[z]; but also note here the factorization 2? + 1 = (2 + 1)?, so 
Corollary 20 is not applicable). 

Although perfect fields are very useful, our definition is unsatisfying 
since it depends on verifications for many polynomials; a more esthetic 
criterion is given in Exercise 16. 


Exercises 


1. C and Q[V2] each can be viewed as an algebra over itself in two 
different ways. (Hint: Either by the identity or by means of the 
homomorphism of example 5.) 

2. Notation as in Theorem 14, show ngyx, y < [EF : K] if o(f) has 
fewer than deg(f) distinct roots in L. 

3. (The Primitive Root Theorem.) If K is a separable, finite extension 
of an infinite field F, then K = Fla] for some a in K. (Hint: By 
iteration, it is enough to show that, whenever K = F[a,6], there is 
cin K for which K = F[c]. Let f,g € F[2] be the respective minimal 
polynomials of a,b. Passing to a splitting field E of fg, take the 
roots @ = Q1,...,@m and b = hy,...,b, of f and g (respectively), 
and note that at+ab 4¢a;+ ab; for all 7 #1 and all z, for all but a 
finite number of ain F. Prove Fla,b] = F[a+ ab] for any such a, as 
follows: Let c= a+ab and L = F[c]. Define h(x) = f(c—az) € L[z]. 
Then h(b) = f(a) = 0, and h(b;) # 0 for all 7 > 1, implying e-b= 
ecd(g(z),h(z)), which can be taken in L[z], implying 6 € L. We 
shall improve this result in Chapter 26.) 

4. (/2+ V3) has degree 4 over Q. (Hint: As in Exercise 3.) 

5. Suppose K = F{a] is a finite field extension of F, and F C LC K. If 
g is the minimal monic polynomial of a over L, then L is generated 
by the coefficients of g. (Hint: let L, be the subfield of LE generated 
by the coefficients of g. Then g is also the minimal polynomial of a 
over L,, implying [K : L,] = deg g =|[K : L], so L = £4.) 


10. 


Ei 


iz 


13. 


. Suppose Fla]/F is finite. Then there are only finitely many fields 


between F and Fla]. (Hint: let f be the minimal polynomial of a 
over F, and take F C L C Fla]. By Exercise 5, L is generated by 
the coefficients of the minimal polynomial of a over LE, and f has 
only a finite number of monic factors.) 


. (Steinitz’ Theorem.) Suppose F is an infinite field, and K is an 


F-field. Then K = Fla] for some a, iff there are only finitely many 
fields between F and K. (Hint: (<) As before, one needs only show 
that F[a,b] = F[a+ab] for some ain F. But Fla+a,b] = Fla+ag)] 
for suitable a, 4 a» in F; this field contains a+ a,b — (a+ ab) = 
(a1 — ag)b.) 


. If K/F is separable and finite, then there are only finitely many 


fields E between F and K, by Exercises 3 and 7. This basic result 
will be reproved in Chapter 26 using the Galois correspondence. 


The Roots of a Polynomial in Terms of Its Coefficients 


. Let s1,....8, be the elementary symmetric polynomials in the in- 


determinates 21,...,%», cf. Exercise 20.11. Suppose a1,...,a, are 
the roots of a monic polynomial f = 07, ajz' € Z[z] in a suitable 
splitting field. Then s;(a1,....@n) = (—1)'an_;, for 1 <i <n. In 
particular, the sum of the roots is —a,, and the product of the roots 
is (—1)"ap. 

Suppose f € Z[z] has leading coefficient m, and ay,...,@, are the 
roots of f in a suitable splitting field. If h(a1,...,2,) is an arbitrary 
symmetric polynomial, then h(may1,...,ma,,) € Z. (Hint: By Exer- 
cise 21.13, may,,..., ma, are the roots of suitable monic polynomial 
of degree n over Z[x], so each s;(mayz,...,May) € Z.) 


Separability and the Characteristic 

Give an example of an infinite field of positive characteristic. (Hint: 
Same idea as Exercise 21.9.) 

If an irreducible polynomial f of F[z] is inseparable, then char( F) = 
p > 0, and f has the form S> a,x"; in other words, f has the 
form g(x”). Conversely, if char(F) = p and f = g(a?), then f is 
inseparable. (Hint: f’ = 0.) 

Suppose K = Fla], and f is the minimal polynomial of a. Suppose 
char(F) =p > 0. Then f is separable iff F[a?] = K. (Hint: If f is 
inseparable, then write f = g(x”) and note a? is a root of g(x), 
implying 


Pl: F] _degf 


Peele eels Fl aeeg 


14. 


15. 


16. 


Lt. 


18. 


19. 


Conversely, if f is separable, then a is a common root of its minimal 
polynomial g over F[a?] and of x? — a? = (x — a)?, implying z — a 
is their gcd, and thus a € F[a?].) 

Suppose char(F) = p > 0, anda € F. Let f = 2? — a. Either f is 
irreducible, or else a is a pth power in F, in which case x? — a is 
also a pth power in F[z]. (Hint: Suppose that f = gh, and take a 
root bin a splitting field F of g. Then g = (x — b)* for some k < p: 
checking the coefficient of 2*~! shows b € F.) 

Give an example of a field extension K/F that is not separable. 
(Hint: F must be infinite, of positive characteristic.) 

A field F of characteristic p > 0 is perfect iff every element of F is 
a pth power in F. 


Calculus through the Looking Glass 

Define a derivation of afield F to be amap D: F > F that satisfies 
equations (1) of the text. Let Fy = {a € F : D(a) = 0}. Show that 
D(XD, aia") = YO, ia;a’~1 D(a) for all a; in Fy, a in F. In other 
words, D(f(a)) = f'(a)D(a), for any f in Fo[z]. Conclude Fo is a 
field; if a € F is separable over Fo, then a € Fo. In particular, any 
derivation on a field of characteristic 0 takes on the value 0 on every 
element algebraic over Q. 

Generalizing Exercise 17, given a field extension K/F, suppose D 
is a derivation of F. If a € K is separable over F, then D has 
a unique extension to Fla]. (Hint: D(3o, aia’) = 3; D(asjat + 
>>, iaja*! D(a). In particular, if f is the minimal polynomial of a 
then 0 = D(0) = D(f(a)) = g(a) + f'(a)D(a), where g is obtained 
by applying D to each coefficient of f. This has a unique solution 
for D(a).) 

As in Exercise 18, except now suppose the minimal polynomial f 
is inseparable. Then taking f to be monic, show that D cannot be 
extended to K unless D sends every coefficient of f to 0, in which 
case D(a) can be taken to be any arbitrary element of K. 


CHAPTER 24. THE GALOIS CORRESPONDENCE 


One of the most beautiful achievements of mathematics is Galois’s theory, 
linking field extensions to groups and thereby enabling one to apply group 
theory to study field extensions. Although this theory can fill a course by 
itself, we shall focus on the main results, in the following three chapters. 


The Galois Group of Automorphisms of a Field Extension 


Definition 1. An automorphism of a field K is an isomorphism K > K. If 
K is an F-field, an automorphism of K/F is an F-automorphism of K, 
i.e., an automorphism that fixes F elementwise. Gal(K/F) is the set of 
automorphisms of K/F and is called the Galois group of K over F. 


Remark 2. Gal(K/F) is indeed a group, whose group operation is given by 
composition of functions. 


This observation revolutionized mathematics, by enabling one to use 
group theory to study fields. But first some basic remarks. 


LEMMA 3. Suppose K/F is a finite extension. Then any homomorphism 
o:K > K satisfying o(F) = F is an automorphism. 


Proof. ¢ is an injection, by Remark 23.1. On the other hand, taking a base 
B of K over F, we see that o(B) is a base of o(K) over o(F) = F, so 
[o(K): F] =[K: F]; thus o(K) = K, implying o is onto. 0 


Remark 4. If F C EC K, then any automorphism of K/L is an automor- 
phism of K/F; hence Gal( K/L) C Gal(K/F). 


Remark 5. Suppose a € K, and let a, =a, a2,...,a¢ be the roots (in K) 
of a polynomial f € F[z]. By Proposition 23.6, any o in Gal(K/F) satisfies 
o(a) = a;, for suitable 7. In particular, if K = Fla], then there is some 
permutation 1, of {1,...,¢} such that o(a;) = a,,;; for each 7. In this 
way 7 +> 7, describes a group injection Gal(K/F) > 5}. 


On the other hand, Proposition 23.4 shows that if K = Fla], then for 
each root a; of f there is an automorphism o of K/F given by a + aj: 
thus in this case | Gal(K/F)| equals the number of roots of f in K. Before 
continuing this analysis, let us consider some examples. 


Example 6. 

(1) F = Qand K = Q[V3]. The minimal polynomial of 3 is x? — 3, 
whose roots are +\/3: hence, any automorphism o of K/F is given 
by o(V/3) = £V3, and thus Gal(K/F) & S:. The same argument 
shows more generally that Gal(K/F) & 8, whenever K = F[,/al, 
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where a € F is not a square in F, provided char(F) 4 2 (to en- 
sure that fa and —,/a are distinct). But, when char(F) # 2, 
any quadratic extension has this form, by Example 21.12(iv). The 
characteristic 2 case is handled in exercise 9. 

(2) F=Qand K = Q| V2]. Since the only other root of 2 — 2 in K is 
— V2, we see Gal(K/F) & So. 

(3) F = Qi] and K = Qji, V2]. Now there are automorphisms sending 
V2 to i” ¥2, for m = 0,1,2,3, so | Gal(K/F)| = 4, and, indeed, 
Gal(K/F) = (c), where a is given by W264 iV. 

(4) F = Q, K = Qi, Y2]. Any automorphism in Gal(K/F) sends 
V2 6 im V2 and iH +i, so there are at most 4-2 = 8 possible 
automorphisms. On the other hand, Gal(K/F) contains both the 
automorphisms o (given in (3)) and 7 given by complex conjugation; 
furthermore, 


tot 1: Y2H -iW?2 = 03( V2), 


TOT ‘ite i=oar(t), 


from which we see ra7~! = o3. Thus the generators o, 7 of Gal(K/F) 
satisfy the relations of the dihedral group D4, and we conclude 
Gal(K/F) = Dg. 

Note: Once one observes that Gal( A/F) has a cyclic subgroup 
(c) of order 4, and that or 4 T0, it is obvious that Gal( K/F) = D4. 
Such short cuts are standard fare in the theory. 


In all of these examples we note |Gal(K/F)| < [K : F], with equality 
often holding. Splitting fields play a key role. 


THEOREM 7. Suppose F is the splitting field of a polynomial f over F. 
Then |Gal(#/F)| < [EF : Fl, equality holding if f is separable. 


Proof. In view of Lemma 3, this is just a restatement of Theorem 23.14. 0 


In order to extract as much as possible from this result we need a 
related notion. Suppose S'is a set of automorphisms of A’. Define the fixed 
subfield K* to be {a € K : o(a) =a for all o in S}. 


Remark 8. K* is a subfield of K, for any set of automorphisms S' of K. 
(Indeed, if o(a) = a and o(b) = b, then o(a +b) = o(a) £a(b) =atb, 
o(ab) = o(a)o(b) = ab, and (for a4 0) o(a~*) =a(a)>t =a"".) 

Remark 9. Let G = Gal(K/F). Then K@ D F, and G = Gal(K/K®). 
(Indeed, G fixes F by definition, so F C K°. Now Gal(K/K®) C G by 


Remark 4. On the other hand, any o in G fixes K@ by definition, so lies 
in Gal(K/K®).) 

We already can make a striking observation. 
PROPOSITION 10. If F is the splitting field of a separable polynomial 
over F, then E¢ = F where G = Gal(E/F). 
Proof. [E : F] = |G| by Theorem 7. But taking F, = EB? we have G = 
Gal(#/F,) by Remark 9, so also [EF : Fi] = |G|. But F C F, C E so we 


conclude F, = F since [F, : F] = ieee =1.0 


To utilize this result we need a way of producing a polynomial f € F[z], 
given its roots a,,...,a; in K. Here is a converse to Remark 5 which 
guarantees that the obvious candidate (x — a,)...(% — a4) € F[z]. 


Remark 11. Suppose G is a set of automorphisms of K’, and 


f=(e—4)...(a—a,) € K[z]. 


Also suppose {a1,...,a1} are distinct, and each o in G acts as a permutation 
on {a1,...,a:}. (In other words, suppose for each o in G that there is a 
suitable permutation m, of {1,...,t} for which o(a,) = a,,., for each 


1<u<t.) Then f € K©[z]. (Indeed, extending o to an automorphism of 
K[2z] via o(a) = x, we have 


o(f) =o(a—a1)...0(@ — ay) = (2 — o(a1))... (2 — o(ar)) = f, 


proving that o fixes each coefficient of f, as desired.) 


So far, we have obtained nice results when F is the splitting field of a 
separable polynomial f over F. However, we often are presented with the 
finite field extension H/F without the polynomial f, and thereby leading 
us to inquire whether this situation really depends on the polynomial. To 
this end we add two definitions to definition 23.26. 


Definition 12. E/F is normal if for each element a of # the minimal poly- 
nomial of a (over F) splits (into linear factors over E). 
E/F is a Galois extension if E/F is both normal and separable. 


Note 13. If E/F is Galois, then E/L is Galois, for any fields F C LC E. 


Although the definitions of separable and normal extensions are for- 
mally rather strong, holding for every irreducible polynomial with a root in 
F rather than merely for one particular polynomial, they are justified by 
the following important result. 


THEOREM 14. The following conditions are equivalent, for a finite field 
extension Ei/F: 

(i) F is the splitting field of a separable polynomial f over F; 

(ii) E? = F for a suitable group G of automorphisms of E; 

(iii) E/F is Galois. 


Proof. (i) > (ii) by Proposition 10. 

(ii) => (iii) For any a, in FE, we must show that the minimal polynomial 
f of a, is separable and splits over EB. Let S = {a,,...,a+} be the set of 
distinct roots of f in &, and let 


g=(—a1)...(@— a4) € Elz]. 


By inspection g is separable and splits, and g|f. But Remark 11 implies 
9g € Flax] so f = g (since f is irreducible in F[:]). 

(ii) => (4) Write H = Flay,...,a4], let f; be the minimal (monic) 
polynomial of a; over F, for each 7, and let f be the product of the f;, 
discarding duplications. Obviously f splits and is separable by Remark 
23.21, since the f; are relatively prime. O 


The really surprising part of Theorem 14 is that condition (ii) is enough 
to guarantee that #/F is Galois, and we shall use this criterion repeatedly. 


Coro.iary 15. E/F is Galois iff |Gal(E/F)| =[E: FI. 


Proof. (€) Let G = Gal(E/F) and F' = E® D F, cf. Remark 9. Then 
[fF : F) =[E: F'], implying F’ = F. Thus E/F is Galois, by Theorem 
14.0 


We also want to show that if Theorem 14(ii) holds, then G = Gal( #/F). 
This requires the following fact. 


LEMMA 16. (E. Artin’s Lemma.) Suppose G is any finite group of auto- 
morphisms of a field E. Then [E : E°] < |G|. (In particular, [EZ : E°] is 
finite.) 


Proof. Write G = {a1 = 1,02,...,0n}. We must show that any m > n 


elements {a1,...,@m} of E are linearly dependent over E®. Indeed, by 
linear algebra, there is a nontrivial solution ;,...,5,, (in #) to the n 
equations 
S\oi(aj)bj =0, L<ign. (1) 
j=l 
Take such 6;,....5m with the smallest possible number of nonzero bj: re- 


ordering the b; if necessary, we may assume b; 4 0; multiplying through 


by bee we may assume 6; = 1. We shall show that each 6; € EG, thereby 
yielding the desired dependence (taking 2 = 1). It is enough to show that 
each 6; is fixed by every o in G. But applying o to (1) yields 


So coi(a;)o(b)) =0, 1<ign: 
g=l1 


since also G = {a01,...,70n} one sees that o(b1),...,7(bm) is a solution 
of (1), and thus o(b,) — 61,...,0(6m) — bm is also a solution of (1). But 
o(b;) — b} =1—1=0, so by hypothesis this solution must be trivial, i.e., 
each o(b;) — 6; =0, as desired. 0 


Proposition 17. If F = B®, then G = Gal(E/F). 


Proof. E/F is Galois by Theorem 14, so | Gal(#/F)| = [EF : F] < |G]; since 
G is a subgroup of Gal(#/F), we conclude that Gal(#/F) =G.O0 


CorRoLiary 18. If G is any subgroup of Gal(E/F), then the extension 
E/E® is Galois, with Galois group G. 


Proof. Apply Proposition 17 to Theorem 14(ii). 0 
The Galois Group of a Polynomial 


Summary 19. Before continuing the theory, let us review the basic setup, 
which utilizes various fundamental notions of abstract algebra. Suppose 
that K = Fla,,...,a+] is a finite field extension of F. For convenience we 
assume F is perfect, which is the case when char(F) = 0 or when F is a 
finite field. Take f to be the product of the minimal monic polynomials of 
the a; (discarding duplication), and let EF be the splitting field of f over F. 
Then E/F is Galois, having some Galois group G of order [EZ : F]. G is 
also called the Galois group of the polynomial f, and is of great import in 
studying polynomials and their roots. Learning the group structure of G 
thereby becomes one of the main objectives of field theory. Some remarks 
along these lines, where n = [K : F] and k = deg f: 


1. n divides |G|, by Remark 23.12; 

2. |G| < n! by Theorem 23.11; 

3. Recall by Proposition 23.6 that any automorphism of E'/F permutes 
the roots of f; since the number of roots of f in FE is k, we get a 
natural group injection G > S;,, and, in particular, |G| divides k! 
Note when f is irreducible that k = n, in which case we conclude 


that |G] divides n! 


The Galois Group and Intermediate Fields 

Let us fix a Galois extension H/F, and let G = Gal(#/F). An F- 
subfield Z of & is called an intermediate field, i.c., F C L C BE. In Exercise 
22.7 we saw that a number a of degree 4 is constructible iff Q[a]/Q has 
an intermediate field (other than Q[a] and Q). In general, the location of 
intermediate fields is of utmost concern, and our object here is to find a 
sublime connection between intermediate fields and subgroups of G. 

Any intermediate field LE gives rise to the subgroup Gal(£/L) of G, 
cf. Remark 4. Conversely for any subgroup H < G; clearly, HE” is an 
intermediate field. Thus we have a way of passing back and forth between 
the subgroups of G and the intermediate fields. 


THEOREM 20. (Fundamental Theorem of Galois Theory) 
(i) Suppose E/F is a Galois extension, and G = Gal(E/F). Then 
there is a 1:1 correspondence 


{subgroups of G} © {Intermediate fields (between F and E)} 
given by Hp 
Gal(#/L) < L. 


In particular, H, = H, iff B= EP. 
(ii) This correspondence satisfies the following extra properties, where 
A<G: 
(l(a) S(2on4 |, (Gx nS b* Fi 
(2) It is order-reversing, i.e., H, C Hy 6 E™ 5 BE: 
(3) H«G iff EF" /F is a normal field extension, in which case we have 
Gal(E" /F) = G/H. 
Proof. We shall show that the composite of these correspondences in each 
direction is the identity. E/E” is Galois, and Gal(E/E”) = H, by Corol- 
lary 18. In the other direction, for any intermediate field L we have F/I 
Galois by Note 13, so EG*(#/) = L by Theorem 14. 
We turn to the additional assertions. 
(1) |H| = [E : E”] by Theorem 7 applied to the Galois extension 
E/E#®: hence 


_(Gl_ [B:F 
[al [er Be] 


(2) Immediate, from the definition of E? and (1). 
(3) We start with a general observation for any 0,7 € G and a € E: 


[IG : H] = [fF : Fi). 


tT(a)=a_ iff ota '(a(a)) = o(a). 


Now take any subgroup H < G, and put L = EE”. Then 
a€L iff r(a)=aforall rin H, iff o(a)e€ pene 
In other words, for any o in Gal(#/F), we have 
E°Ho™ — o(L). 


Now suppose that L/F is normal. If a € L, then, for each o in G, o(a) 
is a root of the minimal polynomial of a, implying o(a) € L; thus o(L) = L. 
Hence 
EeHe™ — o(L)=L= EF, 


so gHa~! = H, by (i). This holds for all ¢ in G, proving H 4G. 
Conversely, suppose H <G. Then o(L) = Bote — BH = LF for all 
a in G, so restriction of each automorphism to LE yields a map 


® : Gal( E/F) > Gal(L/F). 


Clearly, L°(@ C ES = F, so L/F is Galois by Theorem 14, and ®(G) = 
Gal(L/F), by Proposition 17, 1.€., 
® is onto. Furthermore, ker® = {9 € G: ol, = 1|z} = Gal(FZ/Z), 
implying 


Gal(L/F) = Gal(E/F)/ker ® = Gal(#/F)/Gal(#/L). 


O 


We shall call this correspondence the Galois correspondence, and the 
remainder of this text involves its many applications to algebra. 


Exercises 


The Galois Group of the Compositum 
The next few exercises involve the field compositum of subfields, ef. 
Proposition 21.18. 
1. Suppose K, LE are two subfields of F, with K/K ML Galois having 
Galois group G. Then KL/E also is Galois with Galois group G. 
(Hint: KL is the splitting field over L of the same separable poly- 
nomial as K over KL. Let H = Gal(K L/L). Then there is an 
injection gy: H > G, given by restriction to K, and K®) = KL, 
proving y(H)=G.) 
2. Notation as in Exercise 1, show that [KL : L] divides [K : KN L). 


. Suppose that K,,K, are subfields of #, and let F = Ki, Ko. 
If K;/F are Galois for 2 = 1,2, then K, K>/F also is Galois, and 
Gal(K, K2/F) = Gal( K, K,/K,)xGal( kK, K./K>). (Hint: Write K; 
as the splitting field of the separable polynomial f; over F, and thus 
KK also is the splitting field of a suitable separable polynomial. 
Let G = Gal( kK, K2/F) and H; = Gal( A, K2/K;) <G. Note that 
FH, = Gal( Kk, K2/K,K2) = {e}, and, by Exercise 1, ||| H2| = 
[K, Ky: Ky][ky : F] = |G]; conclude with Proposition 6.13.) 

. What is the Galois group over Q of the splitting field of the poly- 
nomial + — 52? + 6? Of the polynomial x° — 2? 

. If char(F) = 0 and & = Fp], where p is a primitive nth root of 1 
then Gal(#/F) is a subgroup of Euler(n). (Hint: F = FQ[p].) 


More on Artin’s Lemma 
. E/E is Galois and finite, for any finite group of automorphisms 
G of an arbitrary field E. 
. (Dedekind Independence Theorem. ) Given any monoid M define a 
(linear) character of M into a field F to be a monoid homomorphism 
x: M > F\ {0}, @e., y(ab) = y(a)x(b). Prove that any distinct 
characters x1,....Xn are independent over F, in the sense that if 
Q1,...,Qy, satisfy S> a;x;(u) = 0 for all win M then each a; = 0. 
(Hint: This the same idea as in Artin’s Lemma.) 
. Any distinct set of homomorphisms of fields K — EF are indepen- 
dent over F in the sense of Exercise 7. 
. (Quadratic extensions in characteristic 2.) Assume char(F) = 2, 
and K is a separable quadratic extension of F. Then K = F[a], 
for suitable @ whose minimal polynomial has the form #? + 7+ a. 
(Hint: Replace x by Gx for suitable 8.) Observe that a+ 1 is also 
aroot of f, so K is Galois over F, with Galois group generated by 
the automorphism given by a> a+ 1. (This result is generalized 
to Galois extensions of arbitrary prime degree, in exercise 23.9.) 
In case K is not separable, then K = Fla] where the minimal 
polynomial has the form «?-+a. In this case, show that every element 
of K \ F is inseparable. 


CHAPTER 25. FINITE FIELDS 


Our first application of the Galois theory is to fields of finite order. Since 
the beginning of computer technology the field Z. has had important ap- 
plications, with “1” denoting a closed circuit and “0” denoting an open 
circuit. Since computers use more than one circuit, a natural and impor- 
tant question is whether there are fields whose orders are higher powers 
of 2, or of other orders. So far we know Z, is a field for p prime, so, in 
particular, there are fields of order 2,3,5,7, and so forth. What about 4 and 
6? More generally, we want to tackle the following two questions, given n 
inN: 

(1) Is there a field of n elements? 

(2) Are there two nonisomorphic fields of n elements? 


The study of finite fields is an excellent example of reasoning from the 
converse. Instead of trying to construct new fields at the outset, we shall 
presume that we have found a finite field F and determine several of its 
properties. Having determined the basic properties, we shall be able to 
construct all finite fields and determine their structure. First let us study 
a very significant subfield. 

The characteristic subfield Fy of F is defined to be the subfield gen- 
erated by 1; by Remark 23.25, Fo = Q iff char(F) = 0, and Fo = Z, iff 
char( FP) = p. Now F D Fy is a field extension, and we can operate our ma- 
chinery of the previous chapters. Suppose |F| =n < oo. Then char(F) = p 
for some prime number 0 < p < n, and Fy & Z,. Let t =[F : Fo] < n. 
Then F has a base {b,...,6;} over Fo, so each element of F can be written 
uniquely in the form ea a,b; for a; in Fy. 

Thus, an arbitrary element of F can be described uniquely by means 
of the vector (a1,...,a4), where each a; € Fo. But Fo has p elements, and 
there are t choices to be made, yielding p* possibilities for (a1,...,a4). This 
proves n =p’. 

PRoposITION 1. If F is a finite field then |F| = p' where p = char(F). 
Conversely if f € Z,|#] is irreducible of degree t, then Z,|2]/{f) is a field 
of order p". 


Proof. We just proved the first assertion; the second is clear, since taking 
F =Z,|2]/(f), we have [F : Z,] = t, so |F| =p‘ as just shown. 0 


Example 2. (i) There is no field of order 6, since 6 is not a prime power. 
(ii) To find a field of order 4 we merely need to find an irreducible 
polynomial f of degree 2 over Z 2; we may assume that f is monic, so 
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f=2?+a,r2+4+ a. Note 0,1 cannot be roots of f, so 


O04 f(0)=ao implying ap =1; 
OA f(=1l+tatao=1lt+a,+1=a,, implying a; =1. 


Thus, f = 27 + 2 +1 is irreducible, and Z.[z]/(f) is the desired field. 


We might think that we are ready to answer question 1. Indeed, by 
Proposition 1, to obtain a field of arbitrary prime power order p*, one merely 
need find an irreducible polynomial of degree ¢ over Z,. This is not as easy 
one might expect, since our more sophisticated techniques (e.g., Gauss’s 
lemma and Eisenstein’s criterion) only work in characteristic 0. There 
exist direct combinatorial proofs of the existence of irreducible polynomials 
of any degree over Z,, for p arbitrary, but these tend to be rather intricate. 
(One reasonable approach is via Exercises 3 and 4.) So let us turn first 
to question 2. There is a nice answer using splitting fields. This will not 
only provide us with the means of solving questions 2 and 1, but eventually 
will provide us with a precise determination of the number of irreducible 
polynomials of degree ¢t over Z,, cf. Exercise 3. 


Remark 3. If F is a field of order n, then G = (F \ {O},-) is a group of 
order n — 1, so any clement of G satisfies 2°~1 = 1, and thus x" = z. But 
0” = 0, so we see that every element of F satisfies the polynomial x” — zx. It 
follows at once that F is the splitting field of «” — x over the characteristic 


subfield of F. We have proved: 

PROPOSITION 4. Any field of order n = p' is the splitting field of the 
polynomial 2" — x over Zp. 

COROLLARY 5. Any two fields of the same order are isomorphic (being 
splitting fields of the same polynomial over the same base field). 


Proposition 4 also gives us the clue for attacking (1); let us take the 
splitting field of 2” — 2 over Z, and prove that it has order n. First we need 
an easy observation. 


PROPOSITION 6. Suppose R is any integral domain with char(R) = p. Then 
there is an injection y: R > R given by aw a?. 


Proof. First we prove y is a homomorphism. Clearly, y(ab) = (ab)? = 
a?b? = g(a)p(b), and 


plat b) = (a+b)? =a? + (iar (part kw = oh te 


= pla) + p(d). 


Finally, g(—a) = (—a)? = (—1)?y(a) = —a. (This is clear if p is odd; on 
the other hand, —1 = +1 if p = 2.) 
But ¢ is an injection, since kery = 0.0 


This homomorphism gy, called the Frobenius map, is most useful. Note 
that y fixes Zp, by Theorem 2.15. 


COROLLARY 7. With notation as in Proposition 6, for any t there is an 
injection 4) = ¢': R > R given by aw a?’ 


PROPOSITION 8. Suppose F is a field of order n = p*. Then {elements of 
F satisfying the equation z” = x} is a subfield whose order is precisely the 
number of roots in F of the polynomial 7” — x. 


Proof. {a € F : a” =a} is just the fixed subfield of F under + = y!. The 
last assertion is obvious. O 


THEOREM 9. There exists a field of n = pt elements, unique up to isomor- 
phism, for any prime number p and any t # 0 in N. Furthermore, this field 
satisfies a” = a for each element a. 


Proof. Let FE be the splitting field of the polynomial f = 2” —z over Z,. By 
Proposition 8, Hy = {a € EB: a” =a} is a subfield that consists of the roots 
of f, so Ky = E by definition. It remains to show the roots of f are distinct 
(so that there are n of them.) But f! =na"~!—1 = —1 (since p divides n), 
so f’ is relatively prime to f, implying f is separable by Corollary 23.19. 
Uniqueness is by Corollary 5.0 


The field of n elements is called GF(n). Note that although we put 
aside the task of finding irreducible polynomials over Z,, we now know that 
they exist, and also have some help in finding them. 


Remark 10. Taking n = pt, we see that K = GF(n) satisfies the following 
properties: 

(1) K = {0,1,a,a7,...,a"-*} for some a € K. (Indeed, K \ {0} is a 
finite multiplicative subgroup of the field K, and thus is cyclic, by 
Theorem 18.7.) In particular, K = Fla] for any field F C K, since 
all of the elements of K have been expressed algebraically in terms 
of a. (Compare with Exercise 23.3.) 

(2) Taking a as in (1), let g be the minimal polynomial of a over Z,. 
Clearly, K = Z,|a] © Z,/(g), so deg g = [K : Z,] =t, implying g is 
an irreducible polynomial of degree ¢ over Z,. 

(3) Notation as in (2), g divides 2” — x, by Proposition 4, and thus is 
separable. 


(4) The same reasoning shows that any irreducible polynomial f over K 
divides 2™ — 2, where m = |K|*8F. 


Let us now compute of the Galois group of an arbitrary finite extension 
K/F of a finite field FP. Of course, K is also a finite field, and K and F 
have the same characteristic, so K = GF(p') and F = GF(p*) for a suitable 
prime number p and suitable s, ¢. 

First assume F = GF(p) = Z,. By Proposition 24.4, K is the splitting 
field of the separable polynomial f = uP’ — a, over Z», and so K/Z, is Galois. 
Let us compute G = Gal(A/Z,). First of all, |G| =[K : Z,] =t. On the 
other hand, the Frobenius map 0: K > K (given by a a?) fixes Z, and 
thus is an automorphism of order t, so G = (¢). 


THEOREM 11. Any finite extension K/F of finite fields is Galois; further- 
more, Gal(K/F) is cyclic, generated by a suitable power of the Frobenius 
automorphism. 


Proof. We proved this for F = Z,. In general, Z, C F C K, so Gal( A/F) 
is a subgroup of the cyclic group Gal(K/Z,) and thus is cyclic. 0 


COROLLARY 11. Every finite field is perfect. 
Further results along these lines are given in the exercises. 


Reduction Modulo p 
Finite fields have surprising applications to algebraicity over Q. Already in 
Chapter 20 we proved the basic irreducibility results of Gauss and Eisen- 
stein by passing modulo p to Z,. In Example 26.5 we shall proceed one 
step further, using this technique to obtain the minimal polynomial of any 
primitive nth root of 1. 


Exercises 
1. Writing GF(4) = {0,1,a,a7}, show directly 1+ a = a’, proving 
there is a unique irreducible polynomial of degree 2 over GF(2). 


2. Factor 2° 


— x into irreducible factors over Z2, and conclude that 
there are exactly two irreducible polynomials of degree 3 over Zo. 

3. Let n,(t) denote the number of irreducible polynomials of degree t 
over Z,. Remark 10(4) implies that n,(t) < a However, a closer 
examination shows }74),%p(d)d = p'; conclude by Mébius inversion 


(Exercise 2.12) that 


_ walt p(t/d)p* 


t 


a 


. Using the formula from Exercise 3, show that n,(t) # 0 for each t, 


thereby providing an independent proof of Theorem 9, which does 
not rely on splitting fields. 


. GF(p’) is isomorphic to a subfield of GF(p*), iff s|t. 
. Reprove Remark 11, via Exercise 23.16. 
. Let n = p'. The field K = GF(p') = {0,1,@,...,a"~7} has y(n—1) 


multiplicative generators, and the minimal polynomial of each has 
t roots. This provides at least ent) 
GF(p) (and incidentally proves that t|]y(n — 1)). (For the precise 


formula, sce Exercise 3.) Display an irreducible polynomial of de- 


irreducible polynomials over 


gree 4 over GF(2), none of whose roots is a multiplicative generator 
of the multiplicative group of GF(16). 


. If a € GF(p") and f is the minimal polynomial of a, then the other 


2 3 
roots of f are aP,aP ,aP ,.... 


. For any finite field K D GF(p') show Gal(K/F) = (ot), where o is 


the Frobenius map. 


CHAPTER 26. APPLICATIONS OF THE GALOIS CORRESPONDENCE 


We are about to embark on some more fascinating applications in alge- 
bra, representing some of the high points in the history of mathematics. 
The underlying philosophy is to use the Galois correspondence to translate 
questions about intermediate fields to parallel questions in group theory, 
and then to use known results about groups (from Part I). 

The applications here include the descriptions of the constructible reg- 
ular n-gons, and an algebraic proof of the “fundamental theorem of algebra” 
(which says that C is algebraically closed). In the next Chapter we shall 
also use Galois theory to prove the celebrated theorem of Ruffini- Abel that 
there is no formula to solve the general quintic equation. 


We start with a rather straightforward application. 
Finite Separable Field Extensions and the Normal Closure 


Proposition 1. If E/F is finite Galois, then it has only a finite number 
of intermediate fields. 


Proof. The intermediate fields correspond to the subgroups of Gal(E#/F), 
which are finite in number (since Gal(#/F) only has 2!¥*¥! subsets). 0 


Note that a two-dimensional vector space V over R has infinitely many 
one-dimensional subspaces. (Indeed, identifying V with the real plane, its 
one-dimensional subspaces correspond to the lines passing through the ori- 
gin, which are infinite in number). Thus, Proposition 1 is quite surprising. 
To push Proposition 1 farther, we resort to a useful technique, called the 
normal closure, which builds up a Galois extension from a separable ex- 
tension. We borrow the idea from the proof of Theorem 24.14 (ii)=> (i). 
Clearly, one can write K = Fla1,...,a,] for suitable a; in K. Letting f; 
be the (monic) minimal polynomial of a; over F for 1 <i < t, we take the 
product f of these f;, discarding duplications, and let EF be the splitting 
field of f over F. FE is called the normal closure of K over F. Clearly, 
E/F is normal and thus Galois. (To justify the use of “the,” one shows 
that (up to isomorphism) F does not depend on the choice of a1,..., a4; cf. 
Exercise 2.) 


COROLLARY 2. If K/F is finite and separable, then it has only finitely 
many intermediate fields. 


Proof. Let FE be the normal closure of K. Then E/F is Galois, so its 
intermediate fields correspond to the subgroups of Gal(#/F'), which are 
finite in number. O 
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See Exercise 6 for a further application. The normal closure also en- 
ables us to tighten our results about separability and Galois extensions, cf. 
Exercises 7 and 8. Here is one tool for using this technique. 


Remark 3. If E/F is Galois and an intermediate field K is invariant under 
G = Gal(E/F), i.e., ok) = K for every o in G, then K/F is Galois. 
(Indeed, as in the proof of Theorem 24.20(3) we see that Gal(K/F) ~ 
Gal(E£/F)/Gal(Z/K), implying that |Gal(K/F)| = [K : F], and thus 
K/F is Galois.) 


PROPOSITION 4. Suppose K = Fla,...,a+] with K/F separable. Let 
E be the normal closure of K over F. If Gal(#/F) = {o1,...,on}, then 


E=Fl{oj(aj):1<i<n, 1 <7 < tH). 


Proof. Let Ey = Floi(aj):1<i<n,1< 7 < ¢. Then & is invariant 
under G, so Ey /F is Galois (and thus normal), implying by Theorem 24.14 
that B= &.0 


Much of the theory of normal extensions and the normal closure can 
be accomplished without separability, cf. Exercises 2 through 6; however, 
since our applications all involve separable extensions we have taken the 
more convenient route, assuming separability throughout. 


Constructible n-gons 

As our first application of the theory, let us finish the classification of 
the constructible n-gons, started in Remark 22.10. To do this, we need to 
calculate the Galois group of the extension obtained by adjoining a primitive 
root of 1. 


Example 5. Cyclotomic extensions. Let us apply these results to the the 
extension # = Q[p] of Q, where p is a primitive nth root of 1. We know 
already for n prime that #”~! + 2"~? +---+ 1 is the minimal polynomial 
of p. Our goal is to see what happens for n not necessarily prime. Define the 
polynomial f, = Ij. Buler(ny (2—P"), called the nth cyclotomic polynomial. 
Recall Euler(n) is {4 :1<k <n, (k,n) = 1}. Thus {p? : 7 € Euler(n)} 
is precisely the set of primitive nth roots of 1. We shall prove for all n 
that f, is irreducible and thus is the minimal polynomial of p, and that 
Gal(#/Q) & Euler(n), so that deg f, = | Gal(#/Q)| = y(n). In the process 
we shall also find an inductive formula that enables us to compute fy, in 
terms of {fg : proper divisors d of n}. 

Indeed, let G = Gal(E/Q). Since F is the splitting field of #” — 1, 
we see by Proposition 10 that E& = Q. But any automorphism o of G 


merely permutes the primitive nth roots of 1 and thus, by Remark 11, 
fn € EP [2] = Qi]. 

We want to show that f, € Z[sx]. This follows by induction on n. 
Indeed, for any d|n note that {(p*)? : 7 € Euler(4)} is the set of primitive 


we 


7-roots of 1, so 


Sasa = Il (x — (p")), 


je Euler( 4) 


and thus, 


r-1l= I[¢ —pi)= [| faa= [| fa = fng. 
d|n d|n 


where g = Tajn., ayn fa. But fa € Z[z] for d < n, by induction, and is 
clearly monic, so g € Z[a] and is monic. Hence f,, € Z[z] by Theorem 20.8. 

To prove f,, is irreducible we appeal to reduction modulo p. Suppose 
fn were reducible. Then the minimal polynomial g of p would be a proper 
factor of f, in Z[z], so some other primitive root p” of 1 would not be a 
root of g, for suitable /& prime to n. Write k = p,...pz as a product of 
prime numbers. Letting pp = p and p; = p?'?*, a primitive nth root of 1 


for each i, we see that pj41 = p;’t'. Furthermore, for suitable 7, p; is a root 
of g, and pi41 is not a root of g. Replacing p by p; and taking p = pi41, 
we may assume p is a root of g, but p? is not a root of g. Note that p does 
not divide n. 

Writing f = gh, we see that p? is a root of h; hence pis aroot of h(x”), 
implying that g and h(a”) are not relatively prime, and thus g divides h(a”). 

Let denote the canonical image in Z = Z/pZ. Then 2” — 1 € Z[z] is 
separable since it is relatively prime to its derivative nx”! (for p does not 
divide n). Thus f,, is separable. On the other hand, f, = gh; hence g and 
h are relatively prime, by Remark 23.21, implying g and h? are relatively 


prime. But the Frobenius map fixes Z,: if h = 0; mj,x* then 
AP = So mia = So mix?! =h(x?), 


so by the previous paragraph g divides h?, contradiction. 

Having concluded that f,, is irreducible, we see that F is the splitting 
field of f, over Q, and thus |G] = deg f, = y(n). It remains to show that 
G = Euler(n). Note that for any k in Euler(n) there is a, in G given by 
p+ p*: since these automorphisms all act differently on p we have y(n) 
distinct automorphisms, so G = {o, :& € Euler(n)}. Furthermore, k 4 07, 


is a 1:1 correspondence from Euler(n) to G, which is a group homomorphism 
(and thus an isomorphism), since for any k, k’ in Euler(n) we have 


t 


anon (p) = on(p™) = (p% )* = p™ = onn(p). 


We already know the structure of Euler(n) from Exercise 18.4, and thus we 
know the Galois group of every cyclotomic extension. 


In the course of this proof, we have shown that «” — 1 = Tain fa. For 
example, for n = 6 we see 


fifefsfe = 2° —1= (2° -1)(2° +1) 
= (x -1)(2? +241)(241)(2? —241) 
= fifsf,:-(2 —-2+1), 


so we conclude fg = 7? — x +1. 


THEOREM 6. A Galois field extension E'/F is quadratically definable iff its 
Galois group is a 2-group. 


Proof. By the Galois correspondence, there is a chain of quadratic field 
extensions F = Fy C F, C Fy C--: C F; = LE, iff there is a corresponding 
chain of subgroups Hy) = Gal(#/F) > Hy D Hy--- D Hy = {e} with 
each [H; : H;41] = 2. But any 2-group has such a chain of subgroups, by 
Proposition 12.8. 0 


THEOREM 7. A regular n-gon is constructible iff n is a product of a power 
of 2, together (possibly) with distinct odd prime numbers each of the form 
oe aie 


Proof. By Remark 22.10 the regular n-gon is constructible iff the primitive 
nth root p of 1 is constructible (as a complex number). By Example 5, Q[p] 
is Galois over Q with Galois group G of order y(n). So, by Theorem 6, p is 


t 


constructible iff |G| is a power of 2. Writing n =p{'...p;* with py....,p: 


distinct primes, we see 


IG] = y(n) = v(py").-- (pe), 


so each p(p;") must be a power of 2. 

Now »(2") = 2%~! is always a power of 2. On the other hand, for 
pF 2, p(p") = (p—1)p"~? is a power of 2 precisely when u = 1 and p—1 is 
a power of 2, yielding the desired result in view of the easy Exercise 22.8.0 


Incidentally, prime numbers of the form n = 2?" +1 have an interesting 
history of their own; they are called Fermat primes, in honor of Fermat’s 


only known mathematical mistake — he thought every number of this form 
is prime. In fact, nis prime for ¢ = 0, 1, 2,3, 4, (n respectively is 3,5, 17,257, 
and 216 +1 = 65537), but Euler proved 237+1 = 641 x 6700417 is not prime, 
and no other Fermat prime is known. It is an open question whether or not 
there are infinitely many Fermat primes, and any new Fermat prime must 
be at least 10°°4°°, so Theorem 7 is somewhat tantalizing in this regard. 


The Fundamental Theorem of Algebra 

A field F is called algebraically closed if every polynomial over F splits 
into linear factors in F[a]. Algebraically closed fields are very useful, and 
form the foundation of algebraic geometry. Our final application of this sec- 
tion is to prove the fundamental theorem of algebra, that C is algebraically 
closed. The following observation comes in handy. 


LEMMA 9. A field F is algebraically closed iff there is no finite field exten- 
sion K D> F. 


Proof. (=) Otherwise take a € K \ F. Then [F[a] : F] is the degree of the 
minimal polynomial of a, which must be 1, so a € F, contradiction. 

(<=) If f € Fla] is irreducible then K = F[sx]/(f) is a finite field 
extension of F, so 1=|[K: F] =deg f.O0 


Ironically, most proofs of the fundamental theorem of algebra come 
from analysis (such as the theory of analytic functions), but E. Artin found 
an elegant proof that is purely algebraic modulo the following basic topo- 
logical fact: 


LEMMA 10. Any polynomial f € R[s] of odd degree has a zero in R. 


Proof. One may assume f is monic. Write f = 7", a;x' with a, = 1, and 
take ¢ > max 9<i<n{|ai|}. Then 


f(—ne) = (—ne)" + Ss a;(—ne)' < (—ne)" + % c(ne)' 


< (—ne)” +ne(ne)""* <0, 


so f(—ne) < 0, and likewise f(nc) > 0. Since f is a continuous function, 
which takes on a negative and a positive value, the intermediate value 
theorem shows that f(a) = 0 for some a in the interval (—ne,+nc). 0 


COROLLARY 11. Every irreducible nonlinear polynomial in R[x] has even 
degree. Consequently, every proper finite extension of R has even degree. 


Proof. The first assertion is clear. To prove the second assertion, suppose 
K DR had odd degree. Taking a € K \R yields deg a = [R[a] : R] which 
divides [K : R] and thus is odd, a contradiction. 0 


Now for the main result. 


THEOREM 12. (The Fundamental Theorem of Algebra.) C is algebraically 
closed. 


Proof. Assume on the contrary there exists a finite extension K D C 
(cf. Lemma 9). Take a € K \ C; let f be the minimal polynomial of a, 
and let F be the splitting field of f over C. Then F is the splitting field of 
(2? +1)f over R, so F/R is Galois with some Galois group G. 

Let S be a Sylow 2-subgroup of G. Then [E£% : R] = [G: S] is odd, 
and thus equals 1 by Corollary 11, i.e, E5 = R, so S = G by Proposi- 
tion 24.17. But then G is a 2-group, so its subgroup Gal(/C) is also a 
2-eroup. By Theorem 6, there is a quadratic field extension of C which, 
by Example 21.12(iv), can be obtained by adjoining the square root of a 
suitable element of C. Thus, we can reach a contradiction by showing that 
Vz € C for cach z in C. This can be seen algebraically, cf. Exercise 12, but 
i@ 


is obvious if we write z = ae’ in polar coordinates (a € Rt); then clearly 


Jz= Jae? .O 
COROLLARY 13. Cis the only proper algebraic extension of R. 


Proof. Suppose E is an algebraic extension of R. Then E[,/—1] is algebraic 
over R[ /—1] = C, so E[/—-1] = C. Hence R C EC C; we are done since 
[(C: RJ) =2.0 


Exercises 


1. (Primitive root theorem revisited.) Show that any finite, separable 
field extension K/F has the form K = Fla] for suitable a in K. 
(Hint: For F finite use Chapter 25; for F infinite use Steinitz’s 
theorem (Exercise 23.7).) 


The Normal Closure 

2. The normal closure is uniquely defined. (Hint: Use the uniqueness 
of the splitting field of a given polynomial.) 

3. Suppose F is the splitting field of a polynomial f over F. For any 
fied K D E and any homomorphism o: F > K such that o( F) = F, 
show that o(F) = E. 

4. Let m = number of F-injections from K to its normal closure E. 
Then | Gal(K/F)| <m < [Kk : F]. 

5. What is the normal closure of the field Qla!/”] over Q? 

6. The normal closure of a quadratically defined field extension over F 
remains quadratically defined over F. (Hint: each new root can be 
obtained by continuing the chain of quadratic extensions.) Conclude 


10. 


1G 


12: 


13. 
14. 


15. 


that any constructible number is contained in a Galois extension 
of Q, whose Galois group is a 2-group. 


Separability Degree 

Define the separability degree [K : F], of a finite extension K/F to 
be the number of F-homomorphisms from K to its normal closure. 
Using the idea of Theorem 23.14, show that [K : F], < [K : F], 
with equality holding iff K/F is separable. 

For any fieds F C K C EF show [F: Fl, = [FE : K],[K : Fla. 
Conclude that E/F is separable iff E/K and K/F are separable. 
An extension K/F is normal, iff every F-injection from K to its nor- 
mal closure F is actually an F-automorphism of K, iff the number 
of F-automorphisms of K is [K : F].. 


The Algebraic Closure 

(Easy algebraic proof that C is closed under taking of square roots.) 
Given c = a+5i for a, bin R, solve a+bi = (u+iv)? = u? —v? +2uvi 
for suitable u,v € R. (Hint: If 6 = 0 then the assertion is obvious, so 
assume 6 # 0. Matching real and complex parts yields a = u? — v? 
and 6 = 2uv. Substitute v = + to obtain a quadratic equation in 
u? which has a positive solution.) 

A field F is an algebraic closure of F if F is algebraically closed and 
is an algebraic extension of F. Prove that if F D F is algebraically 
closed, then any finite extension of F is isomorphic to a subfield of 
FE. Conclude that the algebraic closure of F in F is algebraically 
closed, and thus is an algebraic closure of F. 

The algebraic closure of any field is infinite. (Hint: Lemma 9.) On 
the other hand, any infinite field has the same cardinality as its 
algebraic closure. (Hint: Compare with Exercise 21.6.) 

What is the algebraic closure of Q? 

Viewing GF(p*) C GF(p") in the natural way when s|t, define the 
fied GF(p™) to be the union of these fields, modulo an equiva- 
lence class identifying the appropriate elements. Prove this is alge- 
braically closed, and thus is the algebraic closure of Z,. 

Rather than rely on the fact that C is algebraically closed, one can 
prove, in general, that any field has an algebraic closure. The idea 
is to abstract the proof of Exercise 16: Let F be the field obtained 
by formally adjoining all roots of irreducible polynomials over F. 
This is obtained by taking the disjoint union and making the correct 
identifications via an equivalence relation. There are some delicate 
points in set theory that we are sloughing over. 


CHAPTER 27. SOLVING EQUATIONS BY RADICALS 


In this chapter we present the crowning application of Galois theory: the 
determination, in terms of group theory, of the solvability of equations by 
formulas involving nth roots. To make life easier, we shall assume through- 
out that F is a perfect field, so that we do not need to worry about sepa- 
rability. 


Root Extensions 


Definition 0. A field K is a root extension of F (of degree n) if K = F[a] 
for suitable a € K such that a” € F. 


Note that this definition depends on the correct choice of a. 


Example 1. If char(F) 4 2, then any quadratic extension of F is a root 
extension of F of degree 2, by Example 21.12 (iv). 


For degree > 2 the situation is more complicated, so we want to be 
able to iterate the extraction of roots, as in the next definition. 


Definition 2. Suppose K/F is a field extension. A root tower of height < n, 
for K over F, is a chain of fields 


K=Kj DK, D:D Ki: =F (1) 


(for suitable t) such that K; is a root extension of Kj41 of degree <n, for 
each 7. A polynomial f € F[z] is solvable by radicals of height <n, if the 
splitting field F of f (over F) is contained in a field K that has a root tower 
of height <n. 


Letting a = a” € F we can view a as %/a, thereby justifying the 
terminology “root extension”; the term “radical” is a variant for “root,” 
borrowed from the French. 

By Example 1, every constructible number is contained in a root tower 
over Q. On the other hand, it is easy to find a constructible number, such 
as 2+ V3. which is not contained in any root extension of Q. 


Digression 3. There are two subtle points that should be noted. 

(i) When one writes ~/a, for a € R, one conventionally means the 
“principal” root. Thus, according to Definition 0, Q[2] is a root extension 
of Q (since i* = 1), although, strictly speaking, YI should be interpreted as 
1, not 7. On the other hand, /—1 can only be +7, so viewed in this way we 
see that 7 must be in the root extension Q(,/—1) of Q. We shall define this 
notion precisely in Exercise 24. The situation becomes much more delicate 
when we consider higher roots of 1. Gauss proved that all the cyclotomic 
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polynomials are solvable by radicals, even in this stricter sense. By using 
Definition 0, we bypass his deeper theory of cyclotomic extensions. 

(ii) In the definition of solvability by radicals, one often can take K = 
FE, for example, when F contains enough roots of 1, cf. Proposition 9 
below. In general, however, one might not be able to take K = EF, in which 
case one needs a technical lemma which shows that the normal closure of 
a separable extension with a root tower also has a root tower. The idea 
of the proof is quite clear, but the notation is cumbersome. In order to 
avoid complications, we set aside this technical lemma as Exercise 5, but it 
is needed to prove one direction of Theorem 11. 


Example 4. Let f = «” — 1. Its splitting field over Q is Q[p] (where p is 
a primitive nth root of 1), which is evidently a root extension of degree 
<n. Note that Gal(Q[p]/Q) = Euler(n) is Abelian. (More generally, if 
char F = 0 then Gal(F[p]/F) is Abelian, where p is a primitive nth root 
of 1; cf. Exercise 24.5.) 


Digression 4'. Let us pause for a moment to consider the hypothesis that 
F contains a primitive nth root p of 1. Then the polynomial f = 2” —1 has 
n distinct roots, namely 1,p,...,p”~1, and thus is separable. In particular 
04 f' = nz"“1, so n is not divisible by char F. Thus, our hypothesis on 


primitive roots of 1 implies the hidden hypothesis + € F. 


Example 5. Suppose f = x” — a@ for a in F, and F contains a primitive 
nth root p of 1. Take EF = Fa], where a is a root of f, t.e., a” =a. Then 
E is the splitting field of f, since the other roots p‘a also lie in E, so f is 
solvable by radicals. Moreover, we claim Gal(#/F) is Abelian. Indeed any 
automorphism of & sends a to another root p‘a; if o,7 € Gal(E#/F), then 
o,T are given respectively by a p'a, at} pia for suitable i, 7, and thus 


or(a) = o(p!a) = pip'a = pita = ro(a), 


implying ot = To. (In fact Gal(#/F) is cyclic, cf. Exercise 2.) 


Example 6. Conversely, suppose K/F is Galois with [K : F] =n prime, and 
suppose F contains a primitive nth root p of 1. Then G= Gal(K/F) = (c), 
for suitable ¢ in G, since |G] = nis prime. We shall show that K = F(a!/”) 
for suitable a # 0 in F. To see this, we must prove o(a) = pa for suitable 
ain K,1<2i< n. Then a ¢ F, but o(a") = p"a” = a”, implying 
a” € K& = F. as desired. Thus, we need prove 

PROPOSITION 7. Suppose o is an automorphism of arbitrary order n in a 


field K, and suppose F = K° contains a primitive nth root p of 1. Then 
there is some a in K \ F such that o(a) = p'a for suitable 1 <i <n. 


Proof. Take b € K \ F and let 
a;=b+p to(b) +p Mob) tp tg" 1b), Osten: (2) 


Then o(a;) = p’a;, so ao € F, and it suffices to show that a; ¢ F for some 
1<a<n. Actually, we shall prove more generally that 


n-1 gh n-1 
ob) =o = a, € \> Fai, (3) 
1=0 1=0 


for each k, yielding the desired result (since otherwise 6 € F, a contradic- 
tion). 
To verify (3), note that (2) can be written more concisely as 


n—-1 
a; = S- p %a4(b). 
j=0 


Moreover, yee pes — = ras = 0 for any 1 < u < n, so we see 
n-1 : n-1 ' n-1 ; ; 
Fras = 5 ot Svs 
i=0 j=0 i=0 
n-1 
=o*(b) S°1=no¥(b), 
i=0 


as desired. O 


This proof is full of interesting notions. First of all, the a; in (2) are 
called the Lagrange resolvents of b, a key tool in computing root extensions, 
cf. Exercise 12. (Actually a more general proof of Proposition 7 can be 
obtained via Exercises 20 and 21.) 


Solvable Galois Groups 

The relation between solvability by radicals and the Galois group is our 
principal theme. Recall the definition of “solvable group” from Chapter 12. 
The next lemma is fundamental to the sequel. 


LEMMA 8. Suppose F C FE C K, with K/F and E/F Galois. Then 


Gal(K/F) is solvable, iff Gal(K/E) and Gal(£/F) are solvable. 
Proof. Gal(E/F) = Gal( K/F)/ Gal( K/E), so apply Theorem 12.9. 0 


PROPOSITION 9. Suppose F is a splitting field of f over F, and F contains 
a primitive mth root of 1 for each 1 < m <n, wheren = [E : Fl. The 
following are equivalent: 

(i) f is solvable by radicals of height <n; 

(i) #/F has a root tower; 

(iii) Gal(#/F) is solvable. 


Proof. ((ii) => (i)) By definition. 
((i) => (ili)) We have a root tower 


K=K,2K,2>-:-DK,=F. 


of height < n, with K D EF; we need to show that Gal(K/F) is solvable, 
since then Gal( £/F) is solvable. We proceed by induction on t, noting the 
result is true by Example 5 if t = 1. 

More generally, K;_1 is Galois over F, with Abelian Galois group. Now 
Gal( K/K+_1)<Gal(K/F), and Gal( K/K_1) is solvable, by induction on t. 
Hence Gal( K/F) is solvable, by Lemma 8. 

((iii) => (ii)) Let G = Gal(E#/F). By Proposition 12.12, the solvability 
of G gives us a subnormal series G = Go D G1 D--- D fe}, with each 
[G;_1 : Gj] a suitable prime number p; < n. Letting K; = Ei we see that 
each [K; : Kj-1] = p;, yielding a root tower of height <n, by Example 6.0 


Even in the absence of primitive roots of 1, the same theory can be 
used, merely by adjoining roots of 1 at the onset. This is easy in charac- 
teristic 0, but in characteristic 4 0 one must be careful - one cannot adjoin 
a primitive nth root of 1 if char F divides n, cf Digression 4’. Thus, we 
introduce the assumption char(F’) { n!, cf. Exercise 1, which automatically 
holds in characteristic 0. 


Example 10. Suppose f = x” —a for ain F, but with F not containing a 
primitive nth root p of 1. Then the splitting field of f is E = Flp.a!/"), 
so letting K = F[p] we see that EF D K D F displays f as being solv- 
able by radicals. Furthermore, Gal(/F) is solvable, since Gal( K/F) and 
Gal(#/K) are Abelian. 


THEOREM 11. Suppose F is the splitting field of f over F, and char(F) { n!, 
where n = deg f. Then f is solvable by radicals of height < n, iff Gal( E/F) 
is solvable. 


Proof. Let L be the splitting field of (x™ —1)f over F, a Galois extension 
of F, by Theorem 24.14. Note L = E[p], where p is a primitive n!-root of 1. 
(=>) Suppose F = Ey D FE, D +--+: D FE; = F is a root tower of height 
<n. Then 
L= Eolp] D Fale] 5 --- 3 Eile] = Flo] 


is a root tower of height <n from L to F[p], which extends down to F via 
the root tower F[p] D Flp"] D F[p™"-)] D--+-D F of height <n. 

By Proposition 9, Gal(Z/F[p]) is solvable. But F[p]/F is Galois with 
Abelian Galois group, so Gal(L/F) is solvable, by Lemma 8; hence its 
homomorphic image Gal( H/F) is solvable. 


(<) Let G = Gal(E/F). Then Gal(E'/F)/ Gal(£’/E) & G; replacing 
E by E' we may assume p € &. Then Gal(#/F[p]) C Gal(#/F) is solvable, 
so Proposition 9 shows there is a root tower of height <n from E to F[p]. 
Continuing the root tower from F[p] to F (as above) shows f is solvable by 
radicals of height <n, over F.O 


COROLLARY 12. If deg f < 4, then f is solvable by radicals. 


Proof. The splitting field of f has degree < 4! = 24, so its Galois group has 
order < 24, and thus is solvable, by Corollary 12.11. 0 


Computing the Galois Group 

In order to obtain an example of a polynomial f that is not solvable by 
radicals, we must find f whose Galois group is not solvable. Since S,, is not 
solvable for n > 5, let us find a polynomial whose Galois group is S,. This 
is rather easy to do if we do not care about the base field, cf. Exercise 3, 
but our motivation at the outset was in studying polynomials over Q, so 
it is only fair to look for an example over Q. The explicit computation of 
the Galois group G of an arbitrary polynomial can be rather difficult, and 
is usually accomplished by viewing G explicitly as a group of permutations 
of the roots of f, as follows. 


Remark 13. Recall (Summary 24.19) that if # = F[a] then any automor- 
phism of E/ F permutes the roots a, = a, ...,@,, of the minimal polynomial 
f of a, and thus we get a natural group injection Gal(#/F) > S,, by which 
we shall view Gal( #/F) as a subgroup of Sy. 


Digression 14. One can characterize certain subgroups explicitly by de- 
scribing various combinations of the roots that they fix. Here is the main 
example of this technique. 


Example 15. (The discriminant.) With notation as in Remark 13, view 


G = Gal(f/F) as a subgroup of S,,, and define 


c= {][(ai-aj)):1< 5 <i<n}. 


Any transposition in G sends c to —c. Thus, d = ¢? € F, and is called the 
discriminant of f. Furthermore, the alternating group A,, fixes F(c), so it 
follows that Gal(#/F(c)) = GN A,. In particular, d is a square in F iff 
G < Ay. 

One can describe the discriminant d in terms of the coefficients of f. 
For example, let us take n = 2. If f = 27 + 82 +7 can be factored over E 
as (%@ — a4)(a — a2) then 


d= (a) — a1)" = (42+ a1)? — daa, = fp? - 4y, 


the familiar discriminant of a quadratic polynomial, which is a square over 
F iff f already splits (i.e. the Galois group is Az, which is trivial). 

The discriminant can be computed in general via Exercises 20.12 and 
20.13, and enables us to rederive the classical formulae for equations of de- 
grees 3 and 4, cf. Exercise 11. Furthermore, the discriminant also provides 
a great deal of information about roots, cf. Exercises 8ff. 


Let us return to the main theme. We assume F C R. (In particular, 
we are in characteristic 0, with no problems about separability.) Then 
any f € F[z] factors (over R) into irreducible polynomials f,... ft, each 
of which has degree < 2, by Corollary 26.13. Each root a; of f; can be 
viewed in C and is also a root of f. If a; ¢ R, then, by the quadratic 
formula, its complex conjugate @; is also aroot of f; and thus of f. Thus the 
nonreal roots of f (if they exist) pair off as complex conjugates, and complex 
conjugation in C induces an automorphism of order 2 on the splitting field 
of f over F. 

To apply this to solvability we need one more result from group theory. 


PROPOSITION 16. Suppose p is prime. Then S, is generated by any trans- 
position tT and any cycle o of order p. 


Proof. Reordering the indices, we may assume 7 = (1 2). Replacing o by a 
suitable power, we may assume o1 = 2; renumbering the other indices we 
may assume og = (1 2 ...p). But then we are done by the straightforward 
Exercise 5.8. 0 


CoROLLARY 17. If f € Q[z] is irreducible of prime degree p and has pre- 
cisely 2 nonreal roots in C, then the Galois group of f is Sy. 


Proof. Since f splits over C, we can view the splitting field E of f as a 
subfield of C. By Remark 23.12, p|[Z : F], implying G = Gal(E£/F) contains 
an automorphism @ of order p, by Cauchy’s theorem. Viewing G in S,, we 
see that o corresponds to a cycle of order p. On the other hand, complex 
conjugation switches the two nonreal roots, leaving the real roots fixed, and 
thus yields an automorphism 7 of F corresponding to a transposition in S). 
Hence o,7 generate S,, proving Gal(£/F) = S,.0 


Example 18. Suppose f = x° — pqx + p, for p,q € N and p prime. By 
Eisenstein’s criterion, f is irreducible. On the other hand, we can determine 
precisely how many roots of f are real. One method is to observe that 
f' = 52+ — pq, which is positive for |x|* > Ft and negative otherwise. Since 
f(0) = p > 0, the intermediate value theorem implies that f has a real 
negative root. Moreover, since the graph of f has only two turning points, 
namely at 2, = — fet and #2 =+ fet, we see that f can have at most two 


more real roots; in fact, f has three real roots iff f(z) > 0 and f(z.) < 0. 
For 1 = 1,2, 


pq 4 
f (ai) = 2i(2} —pa) + p= ni(= — pq) +p = (—zaei + Vp. 


Clearly f(a) > 0, and for f(2%2) < 0 one needs $qr2 > 1. Surely this 
holds for g > 2 and p > 2 (for then, 4902 >> 8/4 > 1), so in this case 


we conclude that f has precisely three real roots and two nonreal roots; 


therefore Gal( f) = Ss, by Corollary 17. 


Besides providing negative criteria for solvability by radicals, Galois’s 
theory enables one to obtain striking positive results, such as the following 
lovely theorem of Galois: 


THEOREM 19. Suppose f € F[z] is irreducible of prime degree p, and 
@1,...,@, are the roots of f in a splitting field E. Then f is solvable by 
radicals iff E = F[a;,aj] for each i F 3. 


The proof involves a fair amount of computation concerning transitive 
solvable subgroups of S,, and is given in Exercises 16 through 19. 


Exercises 


1. If char F { n, then one can find a field extension # of F containing 
a primitive nth root of 1. (Hint: f = 2” — 1 is separable. Thus, 
any splitting field of f contains n distinct nth roots of 1, and one 
of these must be primitive, by a counting argument.) 

2. In Example 5 show that Gal(£/F) actually is cyclic. (Hint: Identify 
Gal(E£/F) with a subgroup of the cyclic multiplicative group of nth 
roots of 1.) 


Prescribed Galois Groups 

3. A quick example of a field extension whose Galois group is Sj. 
Let 21,...,%, be commuting indeterminates over Q, and let # be 
the field of fractions of Q[a1,...,2,]. Every permutation o in S,, 
defines the corresponding automorphism of Q[21,...,%,] given by 
Li &oi; this automorphism extends naturally to an automorphism 


of B, via a(S) = one Thus S,, can be viewed as a group of 
automorphisms of E; then E/E%” is Galois with Galois group Sy. 
4. For any finite group G, find a Galois field extension having Galois 
eroup G. (Hint: View G as a subgroup of S,, where n = |G|, and 


apply the Galois correspondence to Exercise 3.) 


Root Towers 

5. If K/F has a root tower of height < n and F is the normal closure of 
K over F, then E/F has a root tower of height < n that terminates 
with the original root tower of K/F. (Hint: Take a root tower 
K=K) > Ki D-:- > Ki =F (for suitable t) of height <n, ie., 
K; = Ki41(a;) with a7 € Ky41 and n; < n, for each i < t. By 
Proposition 26.4, EF = Flaj(a;):1<i<n,1< 37 < #), where 
Gal(Z/F) = {o1 =1, o2,...,0n}. Put Lo = F and, for 1 <i<n, 
L, = Ko.(K)...0;(K) = Lj;-10;(K), taken in BF. Then Ly = K 
and L,, = FE. By taking an analog of the original root tower at each 


stage, show that L;/L,;-, has a root tower of height <n for each 2; 
putting them together then yields the desired root tower for E/F.) 

6. Suppose K/F is Galois of degree p*. Then for any series of exten- 
sions K = Ky D K, D> -:: D Ky = F such that each p; = [K;_1 : K;] 
is prime, each K;_,/K; is necessarily Galois. (Hint: Gal( K/K;_1) 
has index p in Gal(K/K;), so is a normal subgroup.) 

7. Show, in general, (for p,q > 0 and n odd) that the polynomial 
f = 2” — pqx +p always has one or three real roots, and has three 
real roots iff p > weet (Hint: Carry out reasoning analogous 
to Example 18 to arrive at f(z.) = 0 iff z_ > take the 


(n — 1)-power, noting 2j' = 24.) 


aay 


Finding the Number of Real Roots via the Discriminant 
Assume that F C R in Exercises 8 through 12. 
8. The number of real roots of the quadratic polynomial x2? + Bx + ¥ 
is determined by its discriminant. 
9. Suppose deg f = n, and f has m real roots. Then n — ™ is an even 
number, which is divisible by 4 iff the discriminant of f is positive. 


(Hint: Let t = 45™, the number of pairs of nonreal roots. Compute 
the discriminant d = J[(a; — a;)?, by considering separately the 
various cases of whether or not a;,a; are real: 

If a;,a; are both real, then (a; — a;)? > 0. 

If a; but not a; is real, then (a; —a;)(a; — @;) = |a; —a,|? > 0, 
and as noted above, both factors occur in the computation of d. 
Likewise if a; but not a; is real. 

If a; # a; are both nonreal andi ¥ J, then (a; — a;)(@ — 
\(a; — a)? > 0. 

Thus the sign of dis determined by [](a;—4;)?, taken over the 
t pairs of nonreal roots. But each a;— a; is purely imaginary, so its 


aj) = 


square is negative.) 
10. If deg f =1 (mod 4) and d < 0, then f has at least three real roots 


11. 


12; 


13. 


14. 


15. 


16. 


17. 


18. 


and two nonreal roots. 

Calculate the discriminant of the polynomial «” —pqz+p, by means 
of Newton’s formulas (cf. Exercises 20.12 and 20.13), and apply it 
to exercise 9; compare with Exercise 7. 

(Solving the cubic equation.) Suppose f = 23 +uxr+v for u,v € Q. 
Using Exercises 20.12 and 20.13 compute the discriminant of f to be 
4u3 + 277: by adjoining the square root of this clement one obtains 
a field Fy over which the Galois group of f has order 3 and thus can 
be solved by adjoining a primitive cube root of 1 and taking a cube 
root. Carry out the explicit computation by means of Lagrange 
resolvents and the discriminant; the result is Cardan’s formula for 
solving the cubic equation. 

If f € Fiz] has degree 4, and a1,a2,a3,04 € E are roots, show that 
the subgroup fixing the element (a1 — a3)(a@_ — a4) is precisely the 
subgroup of permutations generated by (1 2 3 4) and (1 4)(2 3), 
which can be identified with D4. 


Galois Groups and Solvability 

Suppose that f € F[z] is separable. Write f = (%— a1)...(% — Gn) 
in its splitting field #, with the a; distinct. Then f is irreducible 
over F, iff Gal(#/F) acts transitively on {a1,...,@n}. (Hint: If f is 
irreducible, then F[a,;] + F[a,] over F: lift to Gal(E/F). ) 

The only possibilities for the Galois group G of an irreducible poly- 
nomial f of degree 4 include $4, A,, D4, and the cyclic group of 
order 4. 

In Exercises 16 through 18, suppose p is a prime number. 

Any normal subgroup N #1 of a transitive subgroup of S, is itself 
transitive. (Hint: The orbits have length dividing p.) 
Let G be the subgroup of S, consisting of all permutations that can 
be defined by a formula 


jraj+b (mod p) 


for suitable fixed a,b, where 1 < 7 < p. Show that this actually is a 
subgroup of order p(p — 1) and is a semidirect product of the cyclic 
normal subgroup N of translations 7 4 7+ (mod p), by the cyclic 
subgroup of order p—1 given by 7 # aj (mod p). 

Any transitive solvable subgroup H of S, is conjugate to a sub- 
group of G containing N, with notation as in Exercise 17. (Hint: 
Using the derived subgroups, suppose H‘t-)) 4 0 and H™ = 0. 
By Exercise 16, H‘'~!) is a transitive Abelian subgroup and thus 


19. 


20. 


21. 
22. 


23. 


24, 


25. 


is cyclic, containing a cycle of length p which one may assume is 
o=(12... p),ie,o(7) =7+1 (mod p), for each 7. Then N C H. 
For any T in H, one has trat~! = o% for a suitable number a, so 
t(j +1) =o(3) +4; letting b = 7(0), conclude 7 € G. 

Prove Theorem 19 by means of Exercise 18. (Hint: Exercise 18 
implies that a transitive subgroup of S;, is solvable iff no element 


# (1) fixes any two i # j. Show that Gal(#/F) is solvable iff 
Gal(£/F[a;,a;]) is trivial for each 2,7.) 


The Norm and Trace 

(Hilbert’s Theorem 90.) Given a field extension K/F anda € K 
define the norm Nx /p(a) to be Tj cqa(Kspy 7(4). Note that Nxyr 
is a linear character from K to F, in the sense of Exercise 24.7; 
also N(aa) = a” N(a), for a in F, where n = |Gal(K/F)|. Show 
that if K/F is Galois with cyclic Galois group (c) and if N(a) = 1 
then a4 = otiy for suitable 6 in K. (Hint: This is an elaboration 
of the idea of Proposition 7. Let y; denote the map cc’. Then 
{1,¥1.---Xn—1} are independent linear characters, so there is ¢ such 
that b = >, ao(a)...0* +(a)c! 4 0.) 

Reprove Proposition 7 as a Corollary to Exercise 20. 

Given a field extension K/F and a € K, define the trace Tx ;p(a) 
to be Vy ecai(K/r) 7(@)- Note that Tx/p is a nonzero linear trans- 
formation of vector spaces. Show that if K/F is Galois with cyclic 
Galois group (7) and Tx,;#(a) = 0 then a = o(b) — b for suitable b 
in K. (Hint: This is similar to Exercise 20.) 

Suppose that K/F is Galois of prime degree p, and char F = p. 
Then K = F[b] for some b € K such that b? — b € F. (Hint: Apply 
Exercise 22 to the element 1 to find 6 such that o(b) — b = 1.) 

Say K/F is a strong root extension if there is a in F and suitable n 
such that K = F[p %/a] for every nth root p of 1. Define strongly 
solvable by radicals analogously to Definition 2, and prove that for 
each n < 11 the nth cyclotomic polynomial is strongly solvable by 
radicals. (Hint: Compute the Euler groups.) 

For n = 11, the problem becomes more difficult. Nevertheless, 

Gauss proved that all cyclotomic polynomials are strongly solvable 
by radicals; thus “strongly solvable by radicals” and “solvable by 
radicals” are the same. 
Suppose p is a primitive mth root of 1, and 71,...,7, are the nth 
roots of 1. Then »x/7j, 1 <7 <n, are the mnth roots of 1. ( Hint: 
2" —1 = ]]i_,(2™ —j).) Thus, in proving Gauss’s theorem, one 
may assume n is prime. 


APPENDIX A: TRANSCENDENTAL NUMBERS: € AND 7 


Although our effort in field theory has been concentrated in studying alge- 
braic numbers, “most” complex numbers are transcendental over Q! (The 
argument was sketched in Exercises 21.2 through 21.5.) Strangely enough, 
the task of finding even one specific transcendental number is considerably 
more difficult. Many of the proofs of transcendence of particular numbers 
rely on the fundamental theorem of calculus: If f is a continuous function 
on the interval [a,b] with continuous derivative f’ on (a,b), then 


f(b) — fla) = (b— a) f'(c) 


for some c in (a,)). 

The basic idea in the application is as follows: Suppose a is algebraic 
over Q. Then a satisfies a suitable polynomial f € Z[a] of degree t > 0. 
Consider a closed interval J containing a, but no other root of f, in its 
interior. Then |f’| is bounded on I by some number d; since f(a) = 0 we 
have 


[f(8)| = [f(b) — Fla)| < dlb — al 


for any 6 in J. On the other hand, taking 6} to be rational, say 6 = 2, 
we see that 0 4 f(b) € n~*Z, implying |f(b)| > n7*. Thus, we have a 
contradiction if given any t,d € N* we can find b = = such that 


mm 
n 


n-*>d|b—al. (1) 


This method is applicable to numbers defined by series that converge very 
rapidly. 


Example 1. (Liouville’s Number.) a = S>~, 107“! is transcendental. In- 
deed, in the above notation take b = 37" _, 10-“", where v is to be deter- 
mined in terms of t and d. Then n = 10™ and |b — a] = 0%, 10°" < 
2-10-+!, So for (1) to hold it is enough that 


10. ss oat EN rae, AOE Sa, 


This is certainly true when v > 2d+t. 0 


Transcendence of e 

For more familiar numbers, such as e = 1+1+ > + a +..., amore 
delicate procedure is called for. Having succeeded so far using calculus, let 
us begin by extracting the essence of a polynomial in terms of calculus. 
Write f™ for the i-th derivative of f. Note that in what follows we are 
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sloughing over problems of convergence, which could be bypassed altogether 
by dealing with formal power series (cf. Exercises 16.8ff). 


Remark 2. An infinitely differentiable function f:R — R is a polynomial 
of degree n (over R) iff f+!) = 0. (Indeed, (=) is obvious, and (€) is 
clear by taking the antiderivative n + 1 times.) 

Define f() = f. Then, given any polynomial f(z) = hear Or", we 
can define 


FES pO ape pO gg fOD 4 po, and ty = f(0). 


1=0 
The map f + f defines an R-linear transformation R[x] > R[:]. 


Remark 3. 7; = 3x 


nao Qutl!, where f = sk a,x". (Indeed, it is enough 


to check this for a monomial f = x“, in which case tr = f(0) = u! 
anti 


Recalling ef =1424+H4-- +94 255 +..., we next define 


7 i x 


tilt) = at Gee) 


Fs ce (2) 


and note that |6,,(x)| < el#!. 
Now consider f = 2”. Then f = n(n—-1)...(n—u+1)2"~“, implying 
Tre* =nle* =n! +nlat (n(n—1)...2)2? +---+na" +2" (3) 


; ee (Se at) 


= f+2"6,(z). 


Thus, for general f = 7", a,2", we conclude 
Tet = ft PS Ay tS (2). (4) 


Furthermore, (2) implies | 37"_, auc“du(e)| < | Xo, aue"lel*! = |f(e)Jel*! for 
any c € R. In order to use (4) to its fullest we need a way to estimate Ty 
for certain polynomials f. 


LEMMA 4. Suppose h € Z[a], and define 


f(z) = A(x), g(a) = 


(n — 1)! 


Then Tf,Tq are integers; 
(i) Tr = h(0) (mod n); and 
(ii) tT, =0 (mod n). 


Proof. Write h= eae Ayt", A, € Z. Then 


=o ee (n+u-— 1)! 


(n— 1)! 
t 
=ao+ So ayn(nt1)...(n+u—1); 


ual 


by inspection h(0) = a9 = Tp (mod n). 


=Slayn(n+1)...(nt+u) 30 (mod n). O 


= (n+ u)! 
7? =) (n—1)! 
THEOREM 5. e is transcendental over Q. 


Proof. On the contrary, suppose e is algebraic over Q. Then there are (3, 
in Z, with Bo # 0 and 0”, B,e” = 0. For any polynomial f = J a,2* 
in Qa] we have (by (4)) 


0=Tf eae” = S Bute” = S- Bul f(v)+>> Ay0"6,(v)) = $1 +524+S3, 
v=0 v=0 v=0 Ub 


where S, = Bots, So = NW, Buf (v), and Ss = TW, By Yo, aud" Su(v) (80 
that [9] < Yo, ull f(v)le'"). 

We want to find a polynomial f for which S, is a nonzero integer 
mod p, S, = 0 (mod p), and |S3| < 1; this would yield a contradiction, 
since their sum could not be 0. We take the “Hermite polynomial” 


-1 


(p—1)! 
where p is a sufficiently large prime number. 

|S3| 3 0 as p > oo: indeed, |f(v)| 3 0 as p > co, whereas f, and e!”! 
are independent of p. 


f= (2 —1)?(a — 2)? ...(2—m)?, (5) 


Furthermore, by Lemma 4, S$; is a nonzero integer # 0 (mod p) when- 
ever p > Bom. 

Finally, to show S: = 0 (mod p), put f,(x) = f(a@+v) = a2) 
for a suitable polynomial g,, for 1 < uv < m. Each flv) = f.(0) =T, =0 
(mod p), by Lemma 4(ii). 0 


Although rather intricate in execution, the basic idea of the proof is 
quite simple, once one manages to find the Hermite polynomial f with the 
correct properties. 


Transcendence of 7 

The proof that a is transcendental follows the same lines, but involves 
symmetric polynomials at a key step, so Exercises 20.11 and 23.10 are 
prerequisites. We want to apply the same considerations that concern the 
exponential function, so we relate 7 to e via the formula e7* = —1. Thus, 
it is natural to prove the equivalent statement: 


THEOREM 6. 72 is transcendental. 


Proof. Otherwise mi satisfies some polynomial g = Sy)", Bux", for By € Z, 
Bo # 0. Letting ay = mi, ...,a, be the roots of g in a suitable splitting 
field, we have 1+ e%! = 0, and thus 


0=(1t+e")...(1+e%") 
=1+e% +e7% +---- ee HEMET HEME H+... (6) 


Noting e“‘e%i = e743, we can rewrite the right-hand side of (6) as 


where each b, is a sum of various a;. Suitably rearranging the b,,, we have 
some n < 2” such that 6, 4 0 for 1 < uv < n and 6, = 0 for all v > n. 
Taking m’ = 2” — n, (6) becomes 


0=m' +e. 
v=1 


In view of (4), any polynomial f = So a,,2% € QIz] satisfies 


O0= Tm! + So tre? = rm! + (Fb) re 3 Ob“ 5,,(by)) = Sy +5243 
v=1 v=1 u 


where S, = Tem!, So = T"_, f(by), |S3| < TP, [Ff (b,)le!. This time 
we take 
BrP yp 


uc (p —1)! 


Then f can be written as ne ue qiz’, where the g; are expressions sym- 


(a — 61)... (a — by)”. (7) 


metric in the /ob,,....89b, and thus are symmetric in the $oa,,.... 0am 
and so are integers by Exercise 23.10. 

Now we conclude as in Theorem 5. By Lemma 4, S; is a nonzero 
integer. Take p > S$. As before, Sj is an integer = 0 (mod p) and $3; > 0 
as p > oo, contrary to S, + S)4+ 5; =0.0 


COROLLARY 7. 7 is not constructible, and, in particular, one cannot square 
the circle by means of compass and straight edge. 


APPENDIX B. SKEW FIELD THEORY 


These notes have dealt almost exclusively with commutative rings, in part 
because of the greater ease in dealing with one-sided ideals than with two- 
sided ideals. However, noncommutative rings do arise naturally in many 
mathematical contexts, and should also be considered. In harmony with 
one of the themes of these notes, we shall apply certain noncommutative 
rings (namely skew fields) to elementary number theory, although there are 
several deep connections to geometry and other subjects that are beyond 
the scope of these notes. Our brief excursion into skew fields is motivated 
by the following theorem: 


LAGRANGE’S FouR SQUARE THEOREM. Every natural number is a sum 
of four squares. 


The reader will notice at once the similarity to Fermat’s theorem con- 
cerning primes of the form a? + 67, which equals (a + bi)(a — bt) in Z[i]. So 
we would like to find a similar way to factor a? + b? + c? + d? Of course 
we need too many square roots of —1 to work in C (or indeed in any field), 
and instead must seek our solution in a skew field. (Recall the definition 
from Chapter 13.) In order to proceed, we need noncommutative general- 
izations of some of the notions described in earlier Chapters. But first we 
need to know more about the nature of commutativity. We say elements 
a,b commute if ab = ba. 


Remark 0. Suppose a € R is invertible. If a commutes with bin R, then a7! 


also commutes with 6. (Indeed, a~1b = a~tbaa~1 = a~taba~! = bam?.) 


Although arbitrary elements need not commute, certain elements (such 
as 0,1) commute with every element of the ring. Define the center Cent(R) 
of aring R to be {ce € R: cr = re for all r in R}. Obviously, Cent(R) is a 
commutative ring, which is a field if R is a skew field, by Remark 0. Thus 
any skew field can be viewed as a vector space over its center. 

Generalizing Chapter 21, we say an F-algebra is a ring R containing 
an isomorphic copy of F in its center C, 1.e., there is an injection F > C. 
(There is a more abstract definition of algebra, given in Exercise 2, but this 
definition serves our purposes.) Perhaps the most familiar noncommutative 
algebra is M,(F), the algebra of n x n matrices over a field F, where the 
injection F + M,(F) sends a to the scalar matrix al. 


The Quaternion Algebra 


Definition 1. (Hamilton’s Algebra of Quaternions.) Let H denote the two- 
dimensional vector space over C, with base {1,7}; thus H = C+Cj = 
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R+Ri+Rj+ Rk, where k = i7. We define multiplication on H via the 
rules 


a —jt = k; 
gk = —ky = ty 
ki=-itk=j 
(Note: The last three lines are encompassed by rule ijk = —1. For example, 


kj = —(-Dky = -ijgkkj = -7.) 
This multiplication extends via distributivity to all of H, by 


(a4 + by2 + Cy + dik) (a2 + bot + C2] + dy k) = a3 + b3t ate C33 F d3k: 
for dy, bu,Cu,dy € R, where 


a3 = 4142 — bib — e1€2 — di ds: 
bs = ayb2 + by a2 + c1dy — die; 
€3 = 442 — bydy + c1A2 + dybo; 
dx = a,dy +. bic. — cy bo + dao. 


We have no guarantee a priori that H is an R-algebra, although it is 
not difficult to verify the axioms directly. However, it is more elegant to 
view H as a subalgebra of an algebra we already know, namely M>(C). 


PROPOSITION 2. H is a skew field. 


Proof. There is a map y:H > M2(C) given by y+ 27 BH (2 a Writing 
1, jek for the respective images of 7,7,4, we see that l= é Ey 9 = 
(as and k = ee and we have 72 = - =f? = ijk = —1. Thus 
y respects the defining relations for H and thereby preserves multiplication. 
ker » = 0 by inspection, so we conclude His a ring, and ¢ is a ring injection. 


Furthermore, let d = det ( 2 > = y¥ + 2Z, a positive real number 


whenever y or z # 0; then 


implying (y+ 2j) | =d~1(y — 27). Hence His a skew field. 0 


If it seems that we have pulled ¢ out of a hat, see Exercise 4 for a more 
systematic approach. 


Digression. H has dimension 4 as a vector space over R, whence the name 
“quaternions”; this algebra is closely related to the group of quaternions, 
which we studied earlier, cf. Exercise 18. For many years Hamilton had 
tried in vain to find a skew field of dimension 3 over R, although today it 
is rather easy to see that there is none, cf. Exercise 15. Indeed, there is a 
theorem (which we shall not prove here) that the dimension of a skew field 
over its center must be a square number. 


Remark 8. In the proof of Proposition 2 we defined a multiplicative map 
N:H\ {0} > R® given by 


N(y + 2j) = det © i = yy + 22. 
(N is multiplicative because det is multiplicative.) We can rewrite N as 


N(at+bitej+dk) =a +0? 4c? +d’, which is a sum of four squares. 


Proof of Lagrange’s Four Square Theorem 
This discussion bears directly on Lagrange’s Four Square Theorem, in 
the following manner. 


COROLLARY 4. If n, and ny are sums of four square integers, then so is 
N1zNs. 

Proof. Write n, = a2 +02 +02 +d = N(aytbuitcuj +duk) for u = 1,2. 
Then nynz = N(q), where 


q= (a1 + bi +017 + dik) (a2 + bot +027 + dok). O 


COROLLARY 5. To prove Lagrange’s Four Square Theorem it suffices to 
assume the natural number n is prime. 


Proof. By induction applied to Corollary 4. 0 


LEMMA 6. For any prime number p, there is a number n < § such that np 
is a sum of three square numbers prime to p. 


Proof. One may assume p is odd. In the field Z, the sets 


{-[a?:-2 <a< 2} and {oP +1:-b <b< 8} 


each take on at least pet distinct values, since any duplication comes at 
most in pairs. (In any field the equation x? — ¢ = 0 has at most two 


solutions.) But Z, has p elements, so —[a]? = [b]? +1 for suitable a,b, i.e., 
p divides a? +b? +1. Since a? +b? +1< 2(24)? +1 one sees n < $.0 


The proof of Lagrange’s Four Square Theorem can be concluded by 
induction on the smallest n such that np is a sum of four squares and is not 
too difficult, cf. Exercises 6 and 7. However, it is enlightening to recast the 
proof in terms of the structure of algebras, as was done in proving Fermat’s 
theorem. We shall sketch the proof, leaving the verifications as exercises. 

We need a quaternion version of the Gaussian integers Z[z]. The ob- 
vious candidate is Z+ Zi+ Zj+ Zk, where i? = j? = k? = 1 = —ijk. 
The first step then is to define a Euclidean algorithm analogous to the 


Gaussian integers, but the natural definition doesn’t quite work for this 
ring. However, one can define instead the ring of integral quaternions 
R= {a+ bitej4+ dk : 2a,2b,2c,2d € Z}, and this indeed satisfies the 
Euclidean algorithm (Exercise 8). Consequently, as in the proof of Propo- 
sition 16.11 one concludes that every left ideal is principal. Now we obtain a 
noncommutative arithmetic on R by defining an element r to be irreducible 
whenever Rr is a maximal left ideal and defining gcd(r1, 72) to be that rz 
such that Rr; + Rro = Rr3. One can conclude that an integral quaternion q 
is irreducible iff N(q) is a prime number in Z (cf. Exercise 10), from which 
it follows at once that any prime p in Z can be written as N(q), where q is 
an irreducible integral quaternion dividing p. Hence, 4p = N(2q) is a sum 
of four squares in Z, so we conclude with Exercise 6. 

Let us return to quaternion algebras. We could build a quaternion 
algebra Q starting from any field F; namely take a four-dimensional vector 
space F + Fi+ Fj + Fk with multiplication defined by the relations 7? = 
yj? =k? = -1 = ijk. One could hope for Q to be a skew field, but this 
is not the case when F = Z,. (If a? +6? +c? = np in Lemma 6, then 
ai + bj +ck is a zero-divisor in Z,.) More generally, every finite skew field 
is commutative, but this result needs some preparation. 


Polynomials over Skew Fields 

In this discussion, D denotes a skew field having center F. Note that 
x € Cent(D[z]). If f,g are polynomials, we say g divides f if f = hg 
for some h in D[z]. Also, given f = Xd;x* in D[z] write f(d) for Nd,d’, 
i.e., “right substitution” for d. (One must be careful about the side one 
substitutes, since d need not commute with the d;.) 


Remark 7. Given f € D[x] and din D, we have 
f(a) = q(2)(2 — d) + f(d) 
for some q in D[2]. In particular, f(d) = 0 iff « — d divides f(z). 


The proof follows from the Euclidean algorithm for polynomials (Propo- 
sition 16.7). Explicitly, if f = =%, dz’, then let g(x) = f-—d,x2" '(x—d), 
of degree <n; one sees by induction on n that 

9g = u(2)(@ — d) + g(d) = u(a)(a — d) + f(d), 
implying f =g+d,0""'(2—d) = (qu +dy2""')(2 — d) + f(d). 
REMARK 8. Given f = hg, let us put f = h(x)g(d); writing h = Xd;2", we 
see f = Ldjg(d)x", so f(d) = Ldjg(d)d' = f(d). 
This simple computation provides our main tool: 


PROPOSITION 9. With notation as in Remark 8, if d is a root of f but not 
of g, then g(d)dg(d)~" is a root of h. (Note that here we should assume 
D is a skew field, to insure that g(d) is invertible.) 


Proof. By Remark 8, f(d) = f(d) =0. Hence, x — d divides f = h(x)q(d): 
consequently x — g(d)dg(d)~! divides g(d)(h(a)g(d))g(d)~1 = g(d)h(x) and 
thus divides h. 0 


By a conjugate of din D we merely mean an element of the form ada~!, 
for suitable a in D. Note that if f(d) =0 for f = 0, a;2" € F[z] then 


f(ada~ = Dias ada t= =a) -aid'a = af (dja? =a0da i! = 0: 


1.e. every ee of d is also a root of f. The big difference between 
the commutative and noncommutative cases is that d is its only conjugate 
when D is commutative, whereas there can be an infinite number of distinct 
conjugates when d € D \ F. In fact, there are enough conjugates for the 
following theorem. 


THEOREM 10. If f(x) € Fla] is monic irreducible and has a root d, in D, 
then f = (x2 —-d,)...(a —d,) in D[z], where each d; is a conjugate of d,. 


Proof. Let g = 2 — d,, and write f = hg. Any element d = adja"! 4 dy 
is a root of f, but not of g, so applying Proposition 9 yields a root dz = 
(d—d,)d(d—d,)~' of h; we continue by induction on degree. To make sure 
the procedure does not break down, we require the following observation. 


Remark 11. Assumptions as in Theorem 10, if h € D[z] with deg h < deg f 
and every conjugate of d; is a root of h, then h = f. (Indeed, take a monic 
counterexample fh of minimal degree < deg f; then each conjugate ada 
of dis a root of the polynomial aha~', and thus of h — aha~!, which has 
lower degree since both h and aha~! are monic, a contradiction.) 0 


Actually, Theorem 10 is only half a theorem. There is a sort of unique- 
ness to the factorization, as follows: 


THEOREM 12. If f = (a —d,)...(% —d,) € D[z] and f(d) = 0, then d is 
a conjugate of some d;; in fact, writing f; = (a — d;)(a — dj_1)...(a — d,) 


and fy =1, one has d= f;(d)~+dj41f;(d) for some i. 


Proof. If f:(d) = 0 then d = d,. Hence, we may assume f;(d) #0. Take 7 
with f;(d) #0 and fj41(d) = 0. Letting f = fi41 and g = f; in Proposi- 
tion 9 we see f = (% — djs1)q. so g(d)dg(d)~* is a root of # — dj41. 7.e., is 
d;41 itself. Hence, d= g(d)~'d;4,9(d). 0 


Structure Theorems for Skew Fields 


THEOREM 13. (Skolem-Noether Theorem) Suppose a € D is algebraic 
over F. Any other root b € D of the minimal polynomial f(x) of a is 
conjugate to a. 


Proof. Write f = (a—d,,)...(%—d,) by means of Theorem 10, so that each 
d; is conjugate to a, and apply Theorem 12 (since f also is the minimal 
polynomial of b). 0 


Remark 14. Another way of expressing this result is as follows: If A/F is 
separable and L D F is another subfield of D that is F-isomorphic to K, 
then the isomorphism is given by conjugation by some element d of D; in 
particular, dKd~1 = L. (Indeed, by Exercise 23.3 we can write K = Fla); 
letting b € E be the image under the isomorphism, we see that a and b have 
the same minimal polynomial, so 6 = dad~! for suitable d in D.) 

It is possible to prove this result also for nonseparable extensions, but 
the proof will not be given here. 


Next, define the centralizer Cp(S) of an arbitrary subset S' of a ring 
Rto be {r € R: rs = sr for all s in S}. It is easy to check that Cp(S) is 
a subring of R, and in view of Remark 0, if D is a skew field then Cp(S) 
is a skew field. We are ready for a deep result of Wedderburn. 


THEOREM 15. (Wedderburn) Every finite skew field D is commutative. 


Proof. Let F = Cent(D), a finite field that thus has order p* for a suitable 
prime p and some ¢; let K D F be a commutative subring of D having 
maximal order. Then K is a finite integral domain and thus a field, and 
being a vector space over F, has order m = p“ for some multiple w of ¢. 
Supposing D noncommutative, we see K # F, since F[d] is commutative 
for any din D. By Theorem 26.8, K/F is cyclic Galois, implying by the 
Galois correspondence that there is an intermediate field F C EC K such 
that [Kk : L] = p. 


Let D' = Cp(L). By Theorem 13, the nontrivial automorphism of K/ I 
is given by conjugation by an element d of D; clearly, d € D', implying D! 
is not commutative. Let n = |D'|. A fortiori, K is a maximal commutative 
subring of D'; since |L| = p*~', we see L = Cent(D'). Any commutative 
subring A’ of D! properly containing L must also have order precisely 
p“ =m. (Indeed, K’ is a field extension of L, so [K' : L] > p, implying 
|K'| > m > |K| so by hypothesis |K’| = m.) 

Now viewing H = K \ {0} as a subgroup of the group G = D’ \ {0}, 
ae 
But each subgroup contains the element 1, so the number of conjugates 


uu 


we see that the number of subgroups conjugate to H is at most 


of elements of H in D!' is at most (m= 3)in= 3) +1< n-—1, implying some 
element a of D’ is not conjugate to any element of K. 

Let K’ = Fla]. As noted above, K' is a field of order m. But any 
extension of L of order m is a splitting field of the polynomial 2” — z 
over Z, and thus over L, and hence they all are isomorphic over L; thus 
K and K' are conjugate in D! by Theorem 13, contrary to the assumption 
that a is not conjugate to any element of K.O 


Are there skew fields other than the quaternions? It took algebraists 
another 50 years to produce a skew field other than Hamilton’s. In fact, 
Frobenius proved H is the only noncommutative skew field whose elements 
all are algebraic over R, cf. Exercise 15. 

If one is willing to consider rings with zero-divisors, one has matrix 
rings; then one can generalize parts of Proposition 2 and Remark 3, cf. Ex- 
ercise 12. Actually, another glance at Example 2 reveals the key role played 
by matrices, and, indeed, rings of matrices are the focus of noncommutative 
algebra. But that starts another tale that must be told elsewhere. 


Exercises 


1. M,(F) is an F-algebra, where we identify F with the ring of scalar 
matrices (cf. Exercise 13.1). 

2. Define an F-algebra to be a F-vector space R that also is a ring 
satisfying the following property, for all a in F and r; in R (where 
multiplication is taken to be the ring multiplication or the scalar 
multiplication of the vector space, according to its context): 


a(r1r2) = (ary )rg =11 (are). 
Define the ring homomorphism yg: F > R given by aH a-1. Then 


F-1 is a subfield of Cent(R), so we have arrived at the definition 
in the text. 


10. 


11. 


This definition is very useful in generalizing to algebras over 
arbitrary commutative rings, not necessarily fields. 
Show that the regular representation (Exercise 21.15) also works for 
noncommutative algebras. 
Derive Proposition 2 via the regular representation, viewing H as 
the vector space C+ Cj and considering the matrices corresponding 
to right multiplication by 7,7, / respectively. 
The map”: H > H given by a+ bitej +dkwH a— bi — cj — dk 
is an anti-automorphism (i.e., reversing the order of multiplication) 
of order 2. (Hint: N(q) = qq.) 


Proof of Lagrange’s Four Square Theorem 

The following exercises provide two proofs of Lagrange’s Four 
Square theorem. 
If 2n = a? + 6? 4+ c? + d?, then rearranging the summands so that 
a = b (mod 2) and ¢ = d (mod 2) one can write n as the sum 
(phy + (ashy + (S$) + (8). 
(Direct computational proof of Lagrange’s Four Square Theorem.) 
Suppose p is not a sum of four squares. Take n > 1 minimal, such 
that np = a? +6? +c? + d& is a sum of four squares; n is odd 
by Exercise 6. Take the respective residues a’,b',c',d' of a,b,c,d 
(mod n), each of absolute value < $. Then there exists m such 
that nm = (a')? + (b')? + (c')? + (d')? < n?; hence, m <n. Since 
N(a+bit+cj+dk) = np, note that (a+bi+cj+dk)(a'—b'i—c!j—d'k) is 
a quaternion each of whose coefficients is congruent to 0 (mod n); 
dividing through by n yields a quaternion whose norm is < np, 
contradiction. 
Verify the Euclidean algorithm for the ring of integral quaternions. 
(Hint: Use a similar trick to that used for the Gaussian integers.) 
Any natural number m > 1 is reducible as a quaternion. (Hint: 
Assume that m is an odd prime. Then m divides 1+ a? + b? for 
suitable integers a,b prime to m. Let q=1+ai+bj7 and d= g.c.d. 
(m,q) = 11m+req. N(d—11m) = N(roq) = N(r2)(1+ 07 +87) is a 
multiple of m, implying m|N(d). In particular, d is not invertible, 
and Rm C Rd since q € Rd). 
Prove an integral quaternion r is irreducible iff N(7) is prime. (Hint: 
Take a prime divisor p of N(r) in Z, and let d = g.c.d.(p,7). Then 
p= N(d)=N(r),) 
Suppose F is a field containing a primitive nth root p of 1, and 
let V be the n?-dimensional vector space with base denoted by 
{y%2" :0< u,v <n}, made into a ring by the relations y” = 1 = z” 


12. 


13. 
14. 
15. 


16. 


17. 


18. 


19: 


and zy = pyz. Prove that this is an algebra with center F, and has 
no proper ideals + 0. 

Using the regular representation, view V of Exercise 11 as a subring 
of M,,(F[y]), by means of the determinant, and define the norm as 
in Remark 3. Now prove that if t; and t) each are sums of n? 
nth powers of integers, then so is ty¢.. 


Frobenius’ Theorem The next three exercises comprise a quick 
proof of an interesting result of Frobenius. 

Cr(K) = K, for any maximal commutative subring K of a ring R. 
If $,T C R, then Cr(S UT) = Cr($) 1 Cr(T). 

(Frobenius’ Theorem.) The only skew fields D algebraic over R are 
R,C, and H. (Hint: Suppose D# R,C. Any commutative subring of 
D properly containing the center F must be isomorphic to C. Thus 
F =R. Take a maximal commutative subring C = F[2], where 
7 =—1. Claim: Any element a in D\ C is contained in a copy of H 
that also contains C. Indeed, Fla] ~ C, and thus F[a] = F[j] with 
jvo=—l1. Let e=i7 4+ 71. Then ¢ commutes with both i and j, and 
thus by Exercise 14 is in F. This proves that D' = F+ Fi+ Fj+Fij 
is a subring of D. Now let b = 17 —ji € D'. Then ib = —bi; replacing 
j by b enables us to assume 77 = —j2, and it follows readily that 
D!' = HH, as claimed. 

If D D H then as above, D contains another subring D” = 
P+ Fi+ Fj' + Fij’ » H, where ij’ = —j'i and j/” = —1; hence, 
j'j7-* commutes with 7, implying j’j~! € F[z], or 3! € Fli]gj C D’, 
contrary to D" # D'.) 

Suppose FR is a ring containing a field F, andn =[R: F] < ow. 
Then R is algebraic over F, in the sense that any r in R satisfies a 
nontrivial equation D7, air = 0 for a; in F. 

Given any group G and field F, define the group algebra F[G] 
that as vector space over F has base G, and multiplication of the 
base elements is given by multiplication in G. Show that 7(G) C 
Cent(F[G]). 

Let Q denote the quaternion group (with relations a4 = 1 = b4; 
bab? = a~+; a? = b?). Show that H contains a quaternion group 
(generated by 7,7). Moreover, there is an algebra homomorphism 
R[Q] — H given by aH i, b& 3; the kernel is (a? + 1). 
Generalizing Exercise 17, define the monoid algebra F[M] for any 
monoid M. Show that F[z] can be written as a monoid algebra. 
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group, 191, 195££, 198, 201, 208ff, 211££ 
of polynomial, 201, 211ff 
solvable, 208 
Galois’ Theorem on solvable extensions, 213 
Gauss’ Lemma, 147 
Gaussian integers, 139ff 144, 227 
gcd, 5, 100, 119, 124, 129, 146, 227, 


199, 


208 


of polynomials, 131, 182 
general linear group, see group 
golden ratio, 167 
greatest common divisor, 119, see also gcd 
group, 2ff 
Abelian, 2, 13, 17, 22, 41, 46ff, 86, 208 
finitely generated, 47, 54f 
acting on a set, 75 
action, 75 , see also action 
trivial, 76 
algebra, 232 
alternating, 36ff, 37ff, 83, 89, 211 
automorphism of, 89 
of automorphisms 
of a group, 89, 92 
of a field, 191ff 
classical, 20, 22 
commutator, 86, 92 
cyclic, 39, 53, 75, 134f, 208 
dihedral, 59, 63f, 68, 69, 71, 76, 85, 89, 191 
infinite 62, 64 
Euler, 5, see also Euler 
factor, 31, see also group, residue 
finitely generated, 39 
free, 62 64f 
Galois, see Galois 
general linear, 4, 9, 20ff, 27, 90, 
generated by S, 40 
generators of, 40, 56, 65, 
homomorphism, see homomorphism 
Klein, 34, 42 
linear algebraic, 20 
multiplication table, 23, 27 
nilpotent, 89f 
of class 2, 89, 92 
of class t, 89 
of symmetries 63 
order of, see order 
orthogonal, 21, 27 
p-, 48ff, 74, 88, 90, 
Abelian, 52 
presentation of, 56 
projective linear, 91 
quaternion, 61, 64, 67, 76, 85, 232 
quotient, 31 
representation 70, see also representation 
residue, 31ff 
simple, 83, 91f 
solvable, 87f, 207, 213 
special linear, 21, 22, 27, 90f 
special orthogonal, 21 
symmetric, 4,17ff, 29£,34ff£,37£f, 45, 66ff, 71, 
symplectic, 21 
trivial, 3 
groups 
of odd order, 88 
of order 8, 61 


72, 


76, 


93, 


191, 


of order <16, 70 
of order <60, 82, 84f, 88 
of order Pp, 76, 93 


H 
Hamilton’s algebra of quaternions, 224, see also quaternions 
Hilbert Basis Theorem, 132 
homomorphic image, 26, see also image 
homomorphism, 
F-, 176 
canonical, 32 
of groups, 24ff 
of rings, 103 
ring 175 


I 
ideal 104ff, 107 
finitely generated, 132 
left, 99ff 
nontrivial, 104ff 
principal, 100 
proper, 99f 
maximal, 106 126, 128 
prime, 125, see also prime 
principal, 132 


right, 99 
trivial, 104 
ideals 


of R[S‘], 113 

sum of, 102, 105 
idempotent, 101 
identity, 2 
PEE AK 
image, 26 
injection, 25, 67, 175 
integral domain, see domain 
intermediate field, 200 
invariant, 25 
inverse, 2 
invertible, 2 

left 2 
irreducible, 117ff, 124, 151, 231 
isomorphism, 33 

F- 176f£ 

of groups 24 


K 
Kantor’s diagonalization, 165 
kernel, 25, 29, 32f, 68, 102, 105f 


L 
Lagrange, 10 
resolvent, 209 
Lagrange’s Theorem, 11f 
(four squares) 224, 226f, 231 
lem, 131 


Liouville’s number, 164, 219 


M 
Maclaurin’s expansion, 121 
map, viii 
matrices xi, 3 
matrix 
elementary, 90 
ring, 101, 104, 107, 230 
unit, xi 
upper unipotent, 84 
Mobius 
function, 14 
inversion formula 14 
module 
G- 78 
monoid, 2, 3f, 44 
algebra, 232 
cancellative, 4 
commutative, 2 
finite, 2 
free, 62, 64 
of principal ideals, 129 
opposite, 71 
multiplication map, left, 5 


N 
Nagata’s theorem, 151 
neutral element, 2 
Newton’s formulas, 153 
nilpotent, 131 
Noether’s Isomorphism Theorems 
for groups, 32ff 
for rings, 106 
norm, 216 
normal 
closure, 200f, 204f 
extension 195, 197 
normalizer 79, 84 


0) 
orbit 75 
order, 2 
of element, 10 15, 19, 28, 41, 48ff, 53 
of Galois group, 195ff 
of GL(n,F), 6f 
of group, 2, 11, 15f, 25, 52, 58ff, 69, 80 
of orthogonal group, 27 
of PSL(n,F), 91 
of SL(n,F), 27 
of subgroup, 20, 26, 30, 37, 53, 80 
P 


p-group 48, see also group, p- 

p-Sylow subgroup 49, see also group, p-Sylow 
parentheses, placement of, 34 

partition, G-, 77 


number, 55, 122 
permutation, 17, 38 

even, 35 

odd, 35 

sign of 35, 37 
perpendicular, 168 
PID 124, 126ff, 130 
pigeonhole principle, 5 
polygon, see regular 
polynomial, 176ff, 220 

constant, 115 

content of, 146ff 

cyclotomic, 193 

inseparable, 181, 185f 

irreducible, 146ff,148,149ff,160,176,179,185,187,189,190,212,see a 

irreducible 
minimal 159, 176, 184 
monic, 115, 166 


resultant 131 
ring 114, 132, 133, 147ff, 159ff 
in several indeterminates, 151 
root of, 176f 
satisfied by a, 159 
separable, 181ff, 185f, 191, 215 
symmetric, 152f 
elementary, 152, 184 
power series, formal, 120ff, 151 
prime 
element, 118, 124, 125f, 132, 151 
ideal, 125f, 126, 131, 132 
number, 138ff 
relatively, 125 
primitive root of 1, 135ff, 141ff, 145, 162, 178, 193, 207f, 213, 216, 
Primitive Root Theorem, 204 
principal ideal, 124 
domain, see PID 
projection, 41 
Pythagorean triple, 140, 144f 
Pythagorus, 140 


Q 

quadratic extension, 162 
formula, 155 

quadratically defined, 202 
field, 170ff 

quartic equation, 156f 

quaternion algebra, 224ff, 227, 231 
group, see group, quaternion 


R 
rank of Abelian group, 55 
reduction modulo p, 190 
regular 
n-gon, 169, 173, 174, 202 
representation, 166 
relatively prime, 119 


polynomials, 182 
representation, 70 
of degree n, 70 
of groups, 70, 78 
regular, 70 
of rings, 107, 120, 231, 232 
restriction, 26 
ring, 98ff 
with ACC, 127 
circle, 151 
commutative, 98, 107 
division 98, see also field, skew 
finite, 99 
intermediate, 110 
Noetherian, 127ff 132 
of integral quaternions, 227 
of matrices, see matrix ring 
polynomial, 114ff , see also polynomial 
residue, 105ff 
trivial, 98 
root, see polynomial, root 
extension, 207 
strong, 216 
of 1, primitive see primitive 
of polynomial, 134 
real, 212, 214 
Theorem, Primitive, 183 
tower, 207, 213f 
Ruffini, 157 


S 
semidirect product, 60, 64, 65, 92f 
internal, 93 
semigroup, 7 
inverse, 7 
separability degree, 205 
separable element, 183 
extension, 183, 195, 200, 205 
polynomial, see polynomial, separable 
skew field, see field, skew 
Skolem-Noether Theorem, 229 
solvable 
by radicals, 207ff 
strongly, 216 
group, see group 
splits, 177 
splitting field, 177££, 187, 191ff, 210, 
squaring the circle, 168 
stabilizer, 75 
Steinitz’ Theorem, 184 
subdirect product, 215 
subfield, 112, 187 
characteristic, 187 
fixed, 192 
of GF(pt), 190 
subgroup, 8ff, 13, 26f, 33, 36 
characteristic, 89, 92 


commutator, 86 
of S,, 86 
eyclic; -9 
derived, 87, 92 
generated by S, 64 
multiplicative, of field 134,187 
normal, 29, 36, 69, 81, 82f, 87, 92 
p-, 80 
p-Sylow, 49, 80, 84 
proper, 8 
torsion, 55 
trivial, 9 
subgroups 
Of “Sip Lt 
Abelian, 53f 
Of Zn, 27 
of the Galois group, 197f 
product of, 19f, 21 
submonoid, 8 
subnormal series, 87, 88 
factor of, 87 
subring, 99, 104, 110f 
subrings of Q, 112f 
Substitution lemma, 133 
surjection, 25 
Sylow’s Theorem, 80 
for Abelian groups, 50 


T 
trace, 216 
transcendence 

of e, 219ff 

of -piy. 222E 
transcendental 

element, 159 

number, 159, 165, 219ff 
transitive, 215 
transposition, 18, 35, 37f 
trisecting an angle, 168 


U 

union, viii 

UFD, 119ff, 123, 128, 132, 146ff, 151 

unique factorization domain, 119, see also UFD 
unit element, 98 

universal, 101 


V 
Vandermonde determinant 152, 153 
vector space, 160 


W 
Wedderburn’s Theorem, 229 
Wiles, 140 


Wilson’s Theorem, 13, 22 
wreath product, 93 


Z 
Zorn’s Lemma, 129, 130 


Some of the most famous questions in mathematical history have 
involved equations with coefficients in Z, the set of integers. This 
text deals with their solutions which are achieved with an 
economy of effort through the process of abstraction. This 
deduction of sweeping conclusions from elementary premises is 
at the heart of “abstract algebra,” and Rowen expertly uses this 
approach fo initiate the intelligent student to the “glorious world 
of mathematical discovery.” Algebra presents both results and 
their underlying ideas thus introducing the basic concepts of 
groups, rings, and fields to solve long-standing important 
problems in mathematics. Among the subjects covered are 
results from number theory, the fundamental theory of algebra, 
the unconstructability of certain numbers, the characterization of 
all finite fields, and the unsolvability of the general equation of 
degree 5. Abandoning the encyclopedic techniques of most 
algebra texts, Rowen embarks on recent frontiers in his field 
including new, never-before-published material on noncom- 
mutative algebra. This book will surely instill in the student the 
supreme elegance of this level of abstract algebra. 
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